Draft version March 15, 2013 

Preprint typeset using I^^'T^]X style emulateapj v. 12/16/11 



REDMAPPER I: ALGORITHM AND SDSS DR8 CATALOG 

E. S. Rykoff\ E. Rozo\ M. T. BuSHA^■^ C. E. CuNHA^ A. FiNOGUENOv'', A. Evrard"''^'", J. Had**, B. P. Koester""', 
A. Leauthaud", B. Nord*^, M. Pierre^", R. Reddick^'^, T. Sadibekova"\ E. S. Sheldon", R. H. Wechsler^-^ 

Draft version March 15, 2013 

ABSTRACT 

We describe redMaPPer, a new red-sequence cluster finder specifically designed to make optimal 
use of ongoing and near-future large photometric surveys. The algorithm has multiple attractive 
features: (1) It can iteratively self-train the red-sequence model based on minimal spectroscopic 
training sample, an important feature for high redshift surveys. (2) It can handle complex masks 
with varying depth. (3) It produces cluster-appropriate random points to enable large-scale structure 
studies. (4) All clusters are assigned a full redshift probability distribution P{z). (5) Similarly, 
clusters can have multiple candidate central galaxies, each with corresponding centering probabilities. 
(6) The algorithm is parallel and numerically efficient: it can run a Dark Energy Survey-like catalog in 
^ 500 CPU hours. (7) The algorithm exhibits excellent photometric redshift performance, the richness 
estimates are tightly correlated with external mass proxies, and the completeness and purity of the 
corresponding catalogs is superb. We apply the redMaPPer algorithm to ~ 10,000 deg^ of SDSS DR8 
data, and present the resulting catalog of ^ 25,000 clusters over the redshift range z £ [0.08,0.55]. 
The redMaPPer photometric redshifts are nearly Gaussian, with a scatter cr^ ~ 0.006 at low redshift, 
increasing to Uz w 0.02 at z ^ 0.5 due to increased photometric noise near the survey limit. The 
incidence of projection effects is low (< 5%). Detailed performance comparisons of the redMaPPer 
DR8 cluster catalog to X-ray and SZ catalogs are presented in a companion paper. 
Keywords: galaxies: clusters 



1. INTRODUCTION 
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selfield et al. 2013p. Galaxy clusters are one of the key 



metric surveys such as the Da rk Energy Survey (DES: 



The DES Collaboration 2005), Pan-STARRS 



et al. 2002), the Hyper-Suprime Camera ( HSC and 



the Large Synoptic Survey Telescop e (LSST:|LSST 
Energy Science Collaboration||2012 ) 



Kaiser 
7 



Dark 



Because galaxies are obviously clustered on the sky, 
rich galaxy clusters have been detected as far back as 
the 1800's (Biviano 20001, with the first large catalogs 
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created 50 years ago ( e.g., Abell 1958 Zwicky et al. 



19681 [Abell et al.||1989[ ). More recently, the advent of 
multi-band data has led to a proliferation of optical clus- 
ter finding algorithms. These algorithms use various 
techniques for measuring clustering in both spatial and 
color/redshift space, ranging from simple matched-filters 
to more complicated Voronoi tesselations. These clus- 
ter finders can be divided into rough ly two classes, those 



based on photometric redshifts (e.g., Kepner et al. 
van Breukelen fc Clewley||2009; Miikeraitis et al' 



1999 



Durr e t et al. |2011||Sza bo etal. 2011 ; Soares-Santos et al.| 
2011; Wen ct al. 2012), and those" ut i fizing the cluster 
red -s equence 
2000 



2009 



1999 



Gladde rs fc Yee 
Gal et al. 



et al. 



_^ Annis et al _ 

iKoester et al.|2 007a; Gladders et ai. 
jThanjav ur"^al.||2009( |Hao et al.||2010b| [Murply 
]2012 ). However, relatively few of these optical cat 



alogs have be en utilized for cos mological parameter con 



straints (e.g., Rozo et ar]|2007 



„ |2010; Mana et al..2013) 

Given the above landscape, it is a fair question to ask 
whether the world really needs yet another photometric 
cluster finding algorithm. As we describe below, we be- 
lieve that the answer to this question is yes. In particular, 
there are a variety of important features that any reason- 
able optical cluster finder must have in order to properly 
exploit the photometric data that will become available 
with ongoing or near-future photometric surveys such as 
the DES or LSST. 

What must we require of current photometric cluster 
finders? The key features are as follows. 

1. The algorithm must be able to smoothly detect 
galaxy clusters in a consistent way across a braod 
redshift range. This can be a challenge for photo- 
metric redshift ( "photo-z" ) and red-sequence based 
algorithms alike. For photometric redshift based 
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algorithms, one must be cautious because biases 
and scatter in reported photo- 2:s increase at fainter 
magnitudes where spectroscopic training and vah- 
dation samples can be highly incomplete. For red- 
sequence based cluster finders, one must confront 
the fact that the 4000 A break characteristic of 
the early- type galaxy spectra moves across filters. 
While (7 — r is an ideal color for cluster detection 
at low redshift, one should rely primarily on r — i 
at itermediate redshifts, and i — z aX higher red- 
shifts (and we note that this will also affect photo- 
z-based finders). By z « 1, near- infrared (NIR) 
photometry is required. Being able to smoothly 
transition from one color to the next — or better 
yet, to always use all available photometric data — 
is paramount. 

2. To the extent possible, the algorithm should self- 
train to the available data. For instance, algo- 
rithms reliant on a priori parameterizations of the 
red sequence could easily result in systematic biases 
if the a priori parameterization differs from reality. 
Note that this also impacts photo- z-based cluster 
finders, since there can be unknown and difficult 
to calibrate biases in the photometric redshifts of 
cluster galaxies. 

3. The algorithm should be numerically efficient, ca- 
pable of running on extremely arge data sets within 
reasonable timeframes with modest computational 
resources. 

4. The algorithm must be able to properly account for 
complex survey masks, including varying depth. 

5. The algorithm must allow the construction of 
proper cluster-random points that adequately char- 
acterize the effective survey volume for cluster de- 
tection in order to enable large scale structure stud- 
ies. In particular, it is worth emphasizing that be- 
cause galaxy clusters are extended objects on the 
sky, the galaxy mask used to construct the cluster 
catalog is not the appropriate mask characterizing 
the angular and redshift selection of galaxy clusters 
for any particular cluster finder. 

6. The algorithm should produce a full P{z) distri- 
bution for every cluster. Similarly, given that the 
center of a galaxy cluster can be observationally 
uncertain, there should be a corresponding center- 
ing distribution in the plane of the sky P{n). Note 
that if one adopts the prior that a galaxy resides at 
the center of a galaxy cluster, then the probability 
P(n) collapses to the probability that any given 
cluster galaxy is the correct cluster center. Our 
expectation is that just as P{z) allows one to ade- 
quately recover the redshift distribution of galaxy 
clusters in a statistical sense, so too will centering 
probabilities for cluster galaxies allow one to statis- 
tically recover the angular distribution of clusters 
in the sky, a point that is of critical importance for 
large-scale structure studies. 

7. In order to maximize the cosmological utility of 
the derived cluster samples, the richness estima- 
tors should be fully optimized for the purpose of 



minimizing the scatter in the richness-mass rela- 
tion. 

The red-sequence Matched-filter Probabilistic 
Percolation (redMaPPer) cluster finding algorithm is 
our solution to the above list of must-haves. Concerning 
the last point in particular, over the past several years 
we have empirically explored what wor ks and what does 
not work in estima ting cl uster richness ( Rozo et al.|2009 
20111 [Rykoff et aL]|2012| henceforth R12). i-br instance 
we nave demonstrated that estimating membership 
probabilities for every galaxy is very effective, while 
using hard color cuts to derive cluster membership can 
lead to large biases. We have fully optimized the optical 
detection radii, as well as the luminosity cuts employed 
when counting galaxies. We have also investigated 
whether total galaxy counts or total cluster luminosity 
is a better mass proxy, and whether or not trying to 
add blue galaxies into richness estimates results in 
improvements. All of these lessons have gone into the 
creation of the redMaPPer cluster finder. 

There, is however, one feature of redMaPPer that rep- 
resents more of a personal bias as opposed to an em- 
pirically driven choice, namely, the fact that redMaPPer 
is a red-sequence cluster finder. Indeed, operationally, 
redMaPPer can be easily adapted to work in photo-z- 
space rather than working directly in color-space. How- 
ever, we are wary of reliance on photometric redshifts, 
which become increasingly difficult to characterize for 
faint galaxies due to a lack of spectroscopic training and 
validation samples. Furthermore, cluster galaxies are a 
very particular population, and photo-z estimates tai- 
lored for clusters should be derived separately from the 
total galaxy population. Combined with the fact that we 
have not seen any evidence for photo- z-based algorithms 
outperforming red-sequence methods, we have opted to 
rely on a red-sequence method when developing redMaP- 
Per. Note that while it is true that redMaPPer also 
relies on spectroscopic training samples, our novel red- 
sequence modeling algorithm, an important advantage 
of our algorithm is that we do not require a locally rep- 
resentative training sample: our spectroscopic training 
galaxies can be the brightest cluster galaxies at all red- 
shifts, with no degradation in the performance of our 
photometric redshift estimates. 

The redMaPPer algorithm is designed to handle an 
arbitrary photometric galaxy catalog, with an arbitrary 
number of photometric bands (> 3), and will perform 
well provided the photometric bands span the 4000 A 
break over the redshift range of interest. It is thus well 
suited to curre nt survey s such as the Sloan Digital Sky 
Survey (SPSS: [York et al. 2000) for low and moderate 
redshift clusters (z G [0.05,0.55J), as well as upcoming 
surveys such as DES for low and high redshift clusters 
(z < 1). As a case study, in this paper we present the 
redMaPPer catalog as run on 10,400 deg^ o f photometric 
data from the Eighth Data Release (DR8: Aihara et al. 
20Tl] ) of the SDSS. Wc wiU make the full DR8 redMaPPer 
catalog available after this paper is accepted for publica- 
tion. 

The layout of this paper is as follows. In Section [2] we 
describe the SDSS data used for this work, followed in 
Section [3] with an overall outline of the redMaPPer clus- 
ter finder. In Section [4| we describe the multi-color rich- 
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ness estimator A, which is an update of the single-color 
richness estimator used in R12. In Section [5] we describe 
our strategy for dealing with stellar masks and regions 
of limited depth in the survey. In Section |6j we describe 
the self-training of the red sequence parametrization used 
to detect clusters, as well as measure their photometric 
redshifts, which is described in Section [7] In Section l8] 
we describe our new probabilistic cluster centering al^ 
gorithm. Finally, in Section M we put all these pieces 
together into the redMaPPer cluster finder. The result- 
ing SDSS DR8 redMaPPer cluster catalog is described 
in Section [TOj Next, in Section [TT] we describe a new, 
more accurate method of using the survey data to esti- 
mate the purity and completeness of the cluster catalog, 
and in Section [12] we describe how these methods can be 
applied to generate a cluster detection mask over the full 
sky. A summary is presented in Section |13| In the ap- 
pendices we present several systematic checks, including 
Appendix [O which contains an estimate of the minimum 
number oiTraining spectra required for an accurate red- 
sequence calibration. A full detailed comparison of the 
redMaPPer DR8 catalog to X-ray cluster catalogs and 
other large photometric survey catalogs is presented in a 
companion paper (Paper II: Rozo & Rykoff 2013). When 
necessary, distances are estimated assuming a flat ACDM 
model with = 0.27, and h = 1.0 Mpc, i.e. all quoted 
distances are in Mpc. 

2. DATA 

As discussed above, the redMaPPer algorithm is de- 
signed to handle an arbitrary photometric galaxy cat- 
alog, with an arbitrary number of photometric bands 
(> 3). Of course, the quality of the output depends on 
the quality of the photometry. As a case study, in this 
paper we run redMaPPer on SDSS DR8 data, due to its 
large area and uniform coverage. 

2.1. SDSS DR8 Photometry 

The input gala xy catalog for this w ork is derived from 
SDSS DR8 data (lAihara et al.||2011|). This data release 



includes more than 14,000 deg of drift-scan imaging in 
the Northern and Southern Galactic caps. The survey 
edge used is the same as that used for Baryon Acous- 
tic Oscillation S urvey (BOSS) target selection (Daw- 



son et al. 2013), which reduces the total area to 
10,500 deg with high-quality observations and a well- 
defined contiguous footprint. Similarly, bad field and 
bright star masks are based on those used for BOSS. 

The BOSS bright star mask is based on the Tycho 
catalog ( jll0g et al. 2000). However, this catalog is in- 
complete amie~5rigErend. Cros s-matching Tycho t o 
the Yale Bright Star Catalog (Hoffic it fc Jaschek|[T99l| ) , 
covering 9000 of the brightest stars m the sky (mostly 
visible to the naked eye), we have found an extra 70 
stars — including very bright stars such as Arcturus and 
Regulus — that obviously impatced galaxy photometry 
and detection. We have also found that very large, bright 
galaxies such as M33 cause significant problems for pho- 
tometry in the area, including many spurious sources 
marked as galaxies. To handle these issues, we have visu- 
ally inspected and masked obviously bad regions around 
63 objects brighter t han V < 10 from the New Gen- 
eral Catalogue (NGC: |Sinnott|1988[ ) that are in the DR8 



footprint, as well as the bright stars mentioned above. In 
total, an additional 36deg^ (~ 0.3% of the total area) is 
removed by our combined bright star and galaxy mask. 
After accounting for all the masked regions, the input 
galaxy catalog covers 10,400 deg^. 

As discussed in R12, the careful selection of a clean 
input galaxy catalog is required for proper cluster find- 
ing and richness estimat ion. Our inp ut catalog cuts 
are similar to those fromlSheldon et al.|f2012) used for 



BOSS target selection, with some modifications. First, 
we select objects based on the default SDSS star/galaxy 
separator that have i < 21.0. We then filter all ob- 
jects with any of the following fiags set in the g, r, or 
i bands: SATUR.CENTER, BRIGHT, TOO_MANY .PEAKS, and 
(NOT BLENDED OR NDDEBLEND). Unlike the BOSS target 
selection, wc have chosen to keep objects flagged with 
SATURATED, NOTCHECKED, and PEAKCENTER. 

Particular care has to be made in avoiding over- 
aggressive flag cuts because of the way that the SDSS 
photo pipeline handles dense regions such as cluster 
cores. In these cases, the central galaxy and many satel- 
lites may be originally blended into one object and then 
deblended. However, if there is a problem with one part 
of the parent object — such as a cosmic ray hit that is not 
properly interpolated — then this bad flag is propagated 
to all the children. We have found that removing objects 
marked with SATURATED, NOTCHECKED, and PEAKCENTER 
often mask out cluster centers, while truly saturated ob- 
jects such as improperly classified stars are also rejected 
via the SATUR_CENTER flag cut. Overall, by including 
these objects we increase the number of galaxies in the 
input catalog by less than 2%, and our tests have shown 
no significant effect on the richness measurements except 
for a few clusters for which the cores were inadvertently 
masked out when galaxies with the above flags were re- 
moved. In total, there are 56.5 million galaxies in the 
input catalog. 

In this work, we use CM0DELJ4AG as our total magni- 
tude in the i band, and MODEL_MAG for u, g, r, i, and 
z when computing colors. We limit our input catalog 
to galaxies that have rrii < 21.0, approximately the lOcr 
limit for galaxy detection such that the characteristic 
magnitude error for our faintest objects is ~ 0.1. The 
DR8 iibercalibration procedure yields magnitude unifor- 
mity on the order of 1% in griz and 2% in u. The 
resulting color scatter introduced by the photometry is 
significantly narrower than the width of the cluster red 
sequence. All magnitudes and colors are corrected for 
Galactic extin ction u sing the dust maps and reddening 
law of ,Schlegel et al.| (SFD: 1998). 

2.2. Spectroscopic Catalog 

Although our cluster finder uses only photometric 
data, we require spectroscopic data to calibrate the red 
sequence and to validate our photometric redshifts. For 
this purpose we use the SDSS DR9 spectroscopic cata- 
log (|Ahn et al. 2012|). This spectroscopic catalog has over 
1.3 million galaxy spectra, including over 500,000 Lumi- 
nous Red Galaxies (LRGs) at ^ ~ 0.5 from the CMASS 
sample. As detailed below, we only use ~ 20% of the 
available data in our training, and use the remaining data 
set to validate our photometric redshifts. 

3. OUTLINE OF THE CLUSTER FINDER 
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The redMaPPer algorithm finds optical clusters via 
the red sequence technique. More specifically, it is built 
around the optimized richness estimator A developed in 
R09 and R12. The algorithm is divided into two stages: 
a calibration stage, where we empirically calibrate the 
properties of the red sequence as a function of redshift; 
and a cluster-finding stage, where we utilize our cali- 
brated model to identify galaxy clusters and measure 
their richness. The algorithm is iterative. First, an ini- 
tial rough color calibration is used to identify clusters. 
These clusters are then used to better calibrate the red 
sequence, which enables a new cluster finding run (see 
also |Blackburne fc Kochanek|2012[ for a similar approach 
within the context of cluster finding with spectroscopic 
data sets). These two calibration/cluster finding stages 
are iterated several times before a final cluster finding 
run is made. 

The calibration itself is also an iterative procedure de- 
scribed in detail in Section |6l We start with a set of 
"training clusters" that have a central galaxy with a spec- 
troscopic redshift to calibrate the red sequence model. 
As we show in Appendix [C] our minimal training re- 
quirements for unbiased cluster richness and photomet- 
ric redshift estimation are « 40 clusters per redshift bin 
of width ±0.025. In the case of SDSS DR8, the spectro- 
scopic availability greatly surpasses this requirement by 
many orders of magnitude. However, for upcoming sur- 
veys such as DES probing much higher redshifts, this will 
no longer be the case, and we have developed redMaPPer 
with these limitations in mind. 

For the present work on DR8, we base our training 
clusters on red spectrscopic galaxies. These red galaxies 
are used as "seeds" to look for significant overdensities of 
galaxies of the same color. The significant overdensities 
thus become our training clusters that are used to fit a 
full linear red-sequence model (including zero-point, tilt, 
and scatter) to the sample of all high-probability cluster 
members with a luminosity L > 0.2i*. This luminosity 
threshold is optimal for richness measurements (R12). 
In this way we effectively transfer the "seed" spectro- 
scopic redshift to all high probability cluster members, 
which enables a much more accurate measurement of the 
red sequence. This is especially true for fainter magni- 
tudes where there is very limited spectroscopic coverage. 
Note that because the algorithm utilizes all colors simul- 
taneously, the "scatter" is characterized not by a single 
number but by a covariance matrix. 

Given a red-sequence model, the cluster finding pro- 
ceeds as follows (see Section [9] for details). First, we con- 
sider all photometric galaxies as candidate cluster centers 
(thereby assuming the center of a cluster is located at a 
galaxy position). We use our red-sequence model to cal- 
culate a ph otometric redshift for each galaxy (zj-od! see 
Section 17. and evaluate the goodness of fit of our red 



galaxy template. Galaxies that are not a reasonable fit 
to the red galaxy template at any redshift are not consid- 
ered as possible central galaxies for the purpose of cluster 
ranking. We note that as long as a cluster has at least 
one galaxy that is a reasonable fit to the template, that 
cluster will be considered in the first step of the cluster- 
finding stage. We then use the z^ed value of the candidate 
central as an initial guess for the cluster redshift. Based 
on this redshift, we compute the cluster richness A and 
its corresponding likelihood using a multi-color general- 



ization of the method of R12 (see Section [4]). When a 
significant number of red-sequence galaxies (> 3) is de- 
tected within a 500 kpc aperture, we re-estimate the 
cluster redshift by performing a simultaneous fit of all 
the high probability cluster members to the red sequence 
model. This procedure is iterated until convergence is 
achieved between member sel ectio n and cluster photo- 
metric redshift (z>; see Section 7.2). The resulting list of 
candidate cluster centers is then rank-ordered according 
to likelihood. Starting with the highest ranked cluster 
we measure the richness and membership probabilities. 
These probabilities are then used to mask out member 
galaxies for lower-ranked clusters in a process we term 
"percolation" (see Section 9.3). In this way we prevent 
double-counting of galaxy clusters. 

4. RICHNESS ESTIMATOR A 

The redMaPPer richness estimator. A, is a multi-color 
extension of the richness estimator developed in R09 and 
R12, which we now denote Acoi to indicate that it is a 
single-color richness. Here we review how we calculate A 
and highlight the differences relative to R12. 

Let a; be a vector describing the observable properties 
of a galaxy (e.g., multiple galaxy colors, i-band magni- 
tude, and position). We model the projected distribution 
around clusters as a sum S{x) = Xu{x\X) + b{x) where 
A is the number of cluster galaxies, u(a;|A) is the density 
profile of the cluster normalized to unity, and b{x) is the 
density of background (i.e., non- member) galaxies. The 
probability that a galaxy found near a cluster is actually 
a cluster member is simply 



Pn 



p{x) = 



A'u(a;|A) 



\u{x\\) + b{x) ' 



(1) 



We note that in Section [93} the definition of the mem- 
bership probability will be modified to allow for proper 
percolation of the cluster finder. This modification will 
only impact clusters that are close to each other along 
the line of sight and at comparable redshifts. Regardless, 
the total number of cluster galaxies A must satisfy the 
constraint equation 



\ = Y^p{x\\)= 



Au(a;|A) 



R<RaW 



Au(a;|A) + b{x) 



(2) 



The corresponding statistical uncertainty in A is given by 



Var(A)=^p(a;|A) [1 - p{x\X)] . 



(3) 



In principle, these sums should extend over all galaxies. 
In practice, one needs to define a cutoff radius i?e an d a 
luminosity cut Lcut- In R12 and Rozo et al. (2011) we 
showed that the scatter in the mass-richness relation is 
expected to be minimized when Lcut = 0.2L*, while the 
optical radial cut scales with richness via 



Rc(X) = i?o(A/100.0)'^. 



(4) 



where Rq ~ 1.0/i^^Mpc and f3 — 0.2. We adopt these 
parameters in redMaPPer. 

To determine the cluster richness of a galaxy cluster, 
note that A is the only unknown in Eqns.[2]and|4] There- 
fore, we can numerically solve Eqn.[2]for A using a zero- 
finding algorithm. The solution to Eqn. |2] defines A, 
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and naturally produces a cluster radius estimate via 
Eqn. [4] We emphasize this cluster radius is not a proxy 
for any sort of standard overdensity radius such as R500C 

or i?200m- 

As in R12, we consider three observable properties 
of galaxies for our filter function u{x): R, the pro- 
jected distance from the cluster center; m^, the galaxy 
i-band magnitude, and a color variable. Ideally, our 
color variable would be the full color vector (e.g., c = 
{u — g,g — r,r — i,i — z} in the case of SDSS data). How- 
ever, practical considerations forced us to reduce this in- 
formation to a single value which gives the goodness 
of fit of our red-sequence template. In doing so, we effec- 
tively compress the information contained in the multi- 
dimensional color vector into a single number that mea- 
sures the "distance" in color space between the galaxy of 
interest and our red sequence model. This is described 
in more detail below. 

We adopt a separable filter function 



uix) = [2^i?S(i?)]0(m,)p,(x' 



(5) 



where is the two dimensional cluster galaxy density 
profile, (j)(m) is the cluster luminosity function (expressed 
in apparent magnitudes), and Pi/(x^) is the distribu- 
tion with h' degrees of freedom. The pre-factor 27ri? in 
front of S(i?) accounts for the fact that given S(i?), the 
radial probability density distribution is 2ttR'S{R). We 
summarize below the filters used in redMaPPer. 

4.1. The Filter 

Assume we have a multicolor red sequence model for 
which we have (c|z,mi), the mean color of the red se- 
quence galaxies for any given redshift z and i-band mag- 
nitude mj. Furthermore, assume we have a correspond- 
ing covariance matrix Cint(-z) to describe the intrinsic 
scatter and correlations of galaxy colors about the mean. 

When comparing a given galaxy with color vector c to 
the model color, we can define 

X^{z) = (C - (c|z,mi)) (Cint(2) + Ccrr)"^ (c - (c|z,r71i)) 

where c is the color vector of the galaxy under consider- 
ation, rrii is the observed galaxy magnitude, {c\z,mi) is 
the model color, and Cint(z) is the corresponding covari- 
ance matrix, which itself depends on redshift. The ma- 
trix Corr describes the photometric error of the galaxy 
under consideration. 

For red sequence cluster members, will be dis- 
tributed according to the x^ distribution with v degrees 
of freedom. 



2U''/2-l)„-xV2 



2'^/2r(zy/2) 



(7) 



where v is the number of colors employed when estimat- 
ing x^- Note that for ^ = 1 the x^ filter does not reduce 
to the single color filter of R12. This is because our 
distance measurement does not distinguish between 
galaxies that are too red from galaxies that are too blue, 
so there is some loss of information when moving from 
color-space to x^- While a full i/-dimensional Gaussian 
color filter would work better than our x^ filter — and 
would exactly reduce to the single color Acoi from R12 
when V = 1 — the problem of background estimation for 



such a filter is much more difficult. In particular, in the 
case of DR8, it requires one to estimate the galaxy den- 
sity in a five dimensional space: {mi^u—g,g—r,r—i,i — z}. 
We found these background estimates to be very noisy, so 
we compressed the color information to a single variable 
X^. In this way, at any given redshift the background 
depends only on and x^- 

4.2. The Radial and Luminosity Filters 

For the radial filter, we f ollow R09 and R12 an d adopt 

which 



1995) 



a projected NFW profile ( Navarro et al 

is a good description of the dark matter profile in N 
body simulations. In addition, it has been found to be 
a good description of the radial distribution of cluster 
galaxi es ( Lin fc M.ohr| 2004 Hansen et al.||20d5 Popesso 



et al. 



2007). In R12 it was shown that in order to mmi 



mize the scatter in the mass-richness relation the NFW 
filter works as well or better than other possible radial 
profiles. Therefore, we refer readers to Section 3.1 of R12 
for details on the form of the radial filter. 

For the luminosity filter, we similarly f ollow R09 and 
R12 and adopt a Schechter function (e.g., Hansen et al. 
20091), written as 



(^(m,) cx io-0-4(™.-™.)("+i) exp (^_io-"-4(™.-™.)^ . 

(8) 

In an update from R12, we have set a — 1.0 indepen- 
dent of redshift, which provides a better description of 
the data. The characteristic magnitude, m* , is the same 
as used in R12, calculated for a k-corrected pass ively 



evolving stellar population (Koester et al. 2007b |). In 



the redshift range 0.05 < z < 0.7, appropriate tor DR8, 
m*(z) is well approximated {S < 0.02 mag) by the fol- 
lowing polynomials: 



m^,{z) 



' 22 AA + 3.36 ln(z) + 0.273 \n{zf if z < 0.5, 
-0.0618 ln(z)3 - 0.0227 ln(z)4 
22.94 + 3.08 ln(z) - 11.22 \u{zf if z > 0.5. 
-27.11 ln(z)3 - 18.02 ln(z)4 



For each cluster, m* is taken at the appropriate redshift 
and the luminosity filter is normalized to unity at the ap- 
propriate magnitude cutoff. As with R12, this is taken to 
be 0.2L* , or m» -I- 1.75 mag. Although in the current ver- 
sion of redMaPPer both a and are fixed as described 
above, in future releases we will replace these parame- 
ters with those directly measured from calibration clus- 
ters. We emphasize, however, that modest changes to 
the shape of the luminosity filter result in insignificant 
changes to the recovered richness. Of course, changes 
to the magnitude limit above which one counts galaxies 
has an obvious systematic impact on the richness as one 
moves up and down the luminosity function, although we 
have found these modest shifts no not signifcantly impact 
the mass-richness scatter (see R12). 

4.3. Background Estimation 

As in R12, we assume that the background density 
is uniform, such that b{x\z) = 27: RYig{mi,x^\z) where 
Sg(mi,x^|z) is the galaxy density as a function of galaxy 
i-band magnitude and x^j where x^ is evaluated using 
the red sequence model at redshift z. In this way, the 
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effective background for every cluster is different and de- 
pends on the cluster redshift. 

To calculate the mean galaxy density, we first calculate 
the value for all galaxies in a grid of redshifts with 
spacing 0.02. For computation purposes, we only calcu- 
late the for galaxies that are brighter than O.IL* at a 
given redshift bin. At each redshift we bin the full galaxy 
catalog in an d magnitude using a cloud-in -cells (CIC) 
algorithm (e.g., Hockney &: Eastwood||198i] ), and divide 



by the survey area. For our cells, we use x'^ S [0,20] with 
a bin size of 0.5, and i € [12,miini] with a bin size of 0.2. 
The < 20 cut can be justified by the fact that the sum 
total of cluster membership probability in the redMaP- 
Per catalog for galaxies with x^ & [15,20] is only 0.7%. 
That is, our x^ < 20 cut impacts our results at well 
below the 1% level. The final galaxy number density is 
normalized by the width of each color and magnitude bin. 
To evaluate the background at an arbitrary redshift, we 
linearly interpolate between the backgrounds computed 
along our redshift grid. As noted in R12, because the 
background is measured per square degree, the average 
number of background galaxies as a function of x^, mag- 
nitude, and redshift is automatically accounted for as the 
angular size of the clusters changes with redshift. 

5. HANDLING MASKED REGIONS AND LIMITED 

DEPTH 

In an ideal world, our survey would have uniform 
depth, be deep enough to reach 0.2L^ at all redshifts of 
interest, and there would be no missing and/or masked 
regions, e.g., due to bright stars. Most previous optical 
cluster finders make this simple assumptiorp^ Here, we 
describe how we can properly correct for tEese effects. 
Our approach is conceptually straightforward. Given a 
cluster model and a geometric and magnitude mask, we 
can effectively calculate the fraction of cluster galaxies 
that we expect to be masked out. This correction factor 
is then applied to the "raw" richness to compensate for 
the masked region. In practice, this correction can be 
self-consistently incorporated into the richness estima- 
tion, as described below. Our method is simple to im- 
plement with any geometric mask, including those that 
describe variations in depth. However, we do not take 
into account masks that contain one or more missing 
bands. 



5.1. The Correction Term 
Looking back at Eqn. [2] we have: 

u{Xi) 



Xu{xi) + b{xi) ' 



(9) 



where Xi describes the radial position, color (via x^)i 
and luminosity (via rrii) of each galaxy. This formula- 
tion works if we can see all galaxies, but in reality we 
cannot. Let us then pixelize all observable space x into 
infinitesimal pixels, and let Ni be the number of galax- 
ies in pixel i. Most pixels have Ni = 0, but a few have 
Ni = 1. Thus, the sum over all galaxies can be re- written 



13 An exception is 3DMF l |Milkeraitis et al.|2010l l , for which they 
calculate the fractional area masked tor eacn cluster 



in terms of a sum over all pixels via: 

u{x,) 



1 



N 



\u{xi) + b{xi) ' 



(10) 



In the case of masking, we can only observe the galaxies 
that are inside the mask, so we can split this sum into: 



u{Xi) 



Xu{xi) + h{xi) 



-E 



N, 



u{Xi) 



\u{xi) + b{xi) 



(11) 



The "in" term is the raw A that we usually compute, 
and can be replaced by the standard sum over all ob- 
served galaxies. The "out" term is now a correction to 
the standard expression, call it C, 



c = J2n, 



u(x.i) 



' \u{xi) + b{xi) 

In terms of C, Eqn. [2] can be rewritten as 

u{x) 
\u{x) + b{x) 



^-E 



(12) 



(13) 



gals 



Now, while C is unknown (we cannot see the masked 
region), we can compute its expected value for a cluster 
of richness A. Using the fact that 

{N,) = [\u{x,) + b{x,)]^x,, (14) 

we see that the expectation value of C is given by 



(C|A) = dx u{x). 

J out 



(15) 



In the above equation, we have made explicit the fact 
that C depends on A, both via the cutoff radius used 
in the sum over galaxies, and because the radial filter 
depends on A. Thus, in the presence of missing data, our 
richness estimate is given by the solution to 



i-(qA) = ^ 



u{x) 



\u{x) + b{x) 



(16) 



Note, however, that because C is a unknown, there 
must also be additional measurement error associated 
with this unknown correction. To calculate the vari- 
ance of C, we note that Var(7Vi) — (Ni). To com- 
pute Cov{N„Nj) = {N,Nj) - (Ni) (Nj), we first com- 
pute (NiNj). For infinitesimal pixels, NiNj = 1 implies 
that either both pixels i and j contain cluster galaxies, or 
that one pixel contains a cluster galaxy while the other 
pixel contains a background galaxy, or that both pixels 
contain a background galaxy. Consequently, 

P{N,Nj = 1) = Ax'^[X{\-l)uiUj + X{uibj+Ujbi)+b,bj]. 

(17) 

P{N,Nj ^ 1), subtracting off {N,) (Nj) 



Since {N,Nj) 
we arrive at 



Cov{Ni,Nj) ^ -Ax'^UiUj 
for i j. Putting it all together, we arrive at 

Var(C) = (^'^) " E E Gov{N,,Nj)p,p, 

i i j^i 



dx u{x)p{x) 



dx u(x)p{x) 



(18) 
(19) 
'(20) 
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To propagate the error in C into the error in A, we set 



dX 
dC 



(21) 



A((C» 



where ac — [Var(C)]^/^. The derivative of A with re- 
spect to C is evaluated numerically about the expecta- 



tion value of (C|A), where A is the solution to Eqn. 16 

For future reference, it will be useful to define the 
"scale factor" 

for the case in which the only source of masking is due to 
limited depth. With this definition, a cluster with rich- 
ness A has a total of A/S'(z) galaxies above the limiting 
magnitude of the survey. 

In addition, it is useful to calculate the fraction of the 
effective cluster area that is masked solely by geometrical 
factors such as bright stars, bad fields, and survey edges. 
This is complementary to S(z) defined above in that it 
contains all the masking except the magnitude limit. The 
cluster mask fraction is then 



mask 



/out "(^) 

/ dx u{x) 



(23) 



This quantity is very useful because clusters that are 
strongly affected by edges are more likely to be catas- 
trophically miscentered, and to have poor richness esti- 
mates. Consequently, when defining our cluster catalog 
we will apply a cut in /,„ask- 

5.2. Evaluating the Mask Correction 

Evaluation of the mask correction and its associated 
error on A can be difficult. Fortunately, this problem is 
well suited to Monte Carlo integration. First, define the 
selection function S{x), so that S{x) = 1 if the galaxy 
is in a region where it is detected, and S{x) = if the 
position X is masked out. Then, for any function /(x). 



dx f{x) 



[1-S{x)]f{x)dx, 



(24) 



where the integral on the right hand side is over the full 
cluster region, i.e., R G [0,i?(A)], L > L^^^, and over all 
colors. 

Applying this to Eqn. [15] wc find 



(C) = 



cluster 



[1 — S{x)]u{x)dx. 



(25) 



Since u{x) is the probability distribution for x, we can 
evaluate the above integral using Monte Carlo integra- 
tion by randomly sampling N sets of model galaxy pa- 
rameters from the u{x) filter function, and then comput- 
ing the sample mean of the function 1 — S{x). That is, 
(C) is simply the fraction of random draws that fall in the 
masked region. Similarly, we can evaluate the integrals 
defining Var(C) via 



Var(C)^^ip^p(a=,) 



XN ^ 

out 



(26) 



where N is again the total number of random draws. 
For simplicity, we have replaced the 1 — S{x) terms with 



a summation over all galaxies that are outside the de- 
tectable region due to the mask, as these are the only 
galaxies that contribute to the summand. 

One of the slowest part of this process is drawing ran- 
dom realizations of Xi. However, these random draws 
do not need to be independent from cluster to cluster. 
In practice, we generate a template distribution of 5000 
galaxies, and scale the radius and magnitude to the ap- 
propriate values for each galaxy cluster. We find that 
this number of galaxies gives accurate results for the re- 
covered richnesses and richness errors, except for galaxy 
clusters that are largely masked out. Consequently, our 
final cluster selection criteria includes the requirement 
that the cluster mask fraction, /mask, is less than 20%. 
In Appendix|F]we demonstrate with DR8 data that given 
our filter function this formalism accurately corrects for 
masked regions and limited depth. 

6. CALIBRATION OF THE RED SEQUENCE 
6.1. Outline 

Suppose we have a complete sample of red galaxies 
with spectroscopic redshifts down to the limiting mag- 
nitude of the survey. One can then directly fit a red- 
sequence model to these galaxies to calibrate the color 
as a function of redshift for these galaxies. The question 
then becomes: how does one get a sample of red spectro- 
scopic galaxies? Note that in order to calibrate the tilt 
of the red sequence, it is important to include a signif- 
icant number of less luminous galaxies, which are more 
difficult to come by. 

Our solution is to simply use the cluster members 
themselves. If we know the spectroscopic redshift of a 
cluster, then all the cluster members can be tagged with 
the spectroscopic redshift of their central galaxy, lever- 
aging one spectroscopic redshift into many. Of course, 
from photometric data one can only identify likely clus- 
ter members, so the fit of the red-sequence model must 
account for contamination by non-cluster members. 

In order to fit the red-sequence model, all that is re- 
quired is a sample of galaxy clusters with known spec- 
troscopic redshifts. As discussed in Section [3] we only 
require a limited set of these training clusters. These 
training clusters can be derived from external X-ray and 
SZ catalogs, or from spectroscopic follow-up of likely cen- 
trals in dense regions by running redMaPPer with an 
ad-hoc red-sequence model. 

In the specific case of SDSS DR8, we construct this 
training cluster sample based on existing spectroscopic 
galaxy samples. Each spectroscopic galaxy is used as 
a "seed" around which we look for galaxy clusters by 
identifying nearby overdensities of galaxies with the same 
color as the seed galaxy. The details of these steps are 
described below. We emphasize that our calibration is 
performed using only 2,000 deg^ of SDSS data. As we 
show in Appendix O we show that while we have the 
full wealth of SDSSspectroscopic data at our disposal, 
an equivalent red-sequence model may be derived from 
only 400 central galaxies from z € [0.05,0.6]. 

6.2. Selecting Seed Galaxies and the Initial Color 
Model 

The initial calibration of the red sequence relies on 
spectroscopic "seed" galaxies. This may simply be a set 
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of training clusters with spectroscopy (see Appendix O), 
or in the case of DR8, a broad spectroscopic sample tnat 
contains a sufficient number of red galaxies in galaxy 
clusters. For SDSS spectroscopy, the first step is to iden- 
tify the subsample of spectroscopic galaxies that are red. 
This is achieved by using a single color that samples the 
4000A break for early type galaxies. With SDSS data, we 
use g — r for z < 0.35, and r — i for z > 0.35. Because we 
wish to have a relatively clean selection of red-sequence 
galaxies as our seeds, we approach the problem of select- 
ing these galaxies in several steps. We emphasize that 
some of these steps are only necessary for cutting the 
full list of SDSS spectra to an appropriate red galaxy 
sample. 

Step 1: Perform an approximate red galaxy selection. 
To make this selection, we bin the galaxies in redshift 
bins of width ±0.025. We then use the error-corrected 



( jHac 
Ith (Ti, 



2009 ) to estimate 

the mean and intrinsic width (Tintl^) of the red-sequence 
galaxies. Those galaxies within 2a of the expected mean 
for red galaxies are considered approximately red. 

Step 2: Use this approximate red galaxy selection to 
measure the mean color of the red galaxies as a function 
of redshift. Given our approximate red galaxy sample, 
we refine our initial estimate of the mean color-redshift 
relation by minimizing the function 



c{zi)\ 



(27) 



where c(z) is our model for the color as a function of 
redshift. The function c(z) is defined via spline interpo- 
lation, and the value of the spline nodes are the param- 
eters with respect to which s is minimized. The spline 
nodes are placed on a redshift grid with spacing of 0.1. 
Our use of c(z) indicates that this is our early calibration 
model, which is distinct from the full model color (c|z) 
that is derived at the end of the calibration procedure. 
In defining the function s, we rely on the sum of abso- 
lute values rather than the sum of the squares to make 
the resulting minimization more robust to gross outliers. 
The functi on s is minimized usin g the downhill-simplex 
method of Nelder & Mead ( 1965 ) as implemented in the 
IDL AMOEBA function. 

Step 3: Use the mean color-redshift relation to esti- 
mate the width of the color-redshift relation. We can 
now improve upon our initial estimate of the width of 
the color-redshift relation by minimizing the function 



E 



\\c,-Z{zi)\-MAD\ 



(28) 



where aint{z) = 1.4826 x MAD, where MAD is the me- 
dian absolute deviation of the sample about the median, 
and the factor of 1.4826 relates the MAD to the stan- 
dard deviation for a Gaussian distribution. The value 
MAD is again defined via spline interpolation, with the 
free parameters being the values of the function at the 
nodes. 

Step 4- Generate a final sample of seed galaxies. Fi- 
nally, with the full red spectroscopic galaxy model in 
hand (c(z), CTint(z)), we can cleanly select our seed sam- 
ple. We select all galaxies within 2aintiz) of the model 
color at the spectroscopic redshift of the galaxy, using 
g — r dX z < 0.35 and r — i at z > 0.35. 




0.2 0.3 0.4 0.5 0.6 



Figure 1. Sample of red spectroscopic galaxies selected for train- 
ing in 2000 deg^ of DR8. Top panel shows g — r color and bottom 
panel r — i color. The red galaxy selection is done in g — r {r — i) 
at 2 < 0.35 (z > 0.35), selecting all galaxies within 2a-int{z) of the 
spectroscopic redshift of the galaxy. Note that this selection leaves 
a small number of outliers in the complementary color. 



In Figure[T]we show the final seed spectroscopic galaxy 
selection for the g — r and r — i colors. The large red 
points show the median colors at the node positions, and 
the dashed red lines show the cubic spline interpolation. 
Note that the single-color selection leaves a small number 
of outliers in the complementary color. In addition to the 
seed galaxies, we will make use of our red spectroscopic 
galaxy color model in the following section. 

6.3. Single Color Member Selection 

Having selected our seed galaxies and calibrated a 
rough initial color-redshift relation, we now proceed to 
find likely cluster members around each of our seed galax- 
ies. For this first iteration we rely on single-color based 
membership. Specifically, in R12 we demonstrated that 
for moderately rich (A > 20) clusters, one can reliably es- 
timate the red sequence directly from the data as follows. 
First, we select all galaxies within a color window around 
the seed galaxy. Next, we fit for the amplitude and tilt 
of the red-sequence of that galaxy cluster directly from 
the galaxy data. However, in extending this algorithm 
to high redshift, we found that large photometric errors 
can introduce an unacceptable amount of noise in the 
initial color-box selection of galaxies. Therefore, rather 
than drawing a color box around the color of the central 
galaxy for the initial fit, we draw the color box around 
the model color c{z) calibrated in the previous section. 
In detail, we: 

1 . Take a red galaxy of known spectroscopic redshift 
(the "seed"). 

2. Select all galaxies within 500/i~^kpc of the spec- 
troscopic galaxy, as well a s 2a of the model color 
determined in Section |6.2[ For the model color, we 
use g — r at z < 0.35 and r — i at z > 0.35. The 
width a of the color box is set to 0.05 and 0.03 re- 
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Figure 2. Magnitude distribution of the red spectroscopic seed 
galaxies (black histogram) and photometrically selected cluster 
galaxies (red dashed histogram) in the redshift slice z £ [0.24,0.26]. 
The "cluster member" histogram weights each galaxy by its mem- 
bership probability. The gain in the training sample of faint red- 
sequence galaxies through our photometric selection is of critical 
importance for an accurate calibration of the red-sequence as a 
function of redshift, particularly at the faint end. 



spectively, which we expect to be the approximate 
red sequence width (e.g., R12). 

3. Fit the red sequence (slope and intercept) of these 
galaxies. 

4. Measure the single-color Acoi using the method of 
R12 and a fixed aperture of 500 hT^ kpc. 

5. For all overdensities with Acoi > 10, take the galax- 
ies with pincm > 0.3 and assign them the spectro- 
scopic redshift of the initial seed galaxy. 

At this point, we have leveraged the spectroscopic seed 
galaxies to generate a set of red galaxies as faint as 0.2L* 
over the redshift range of interest. Although not all of 
these galaxies are true cluster members, we have an es- 
timate Pmem of the probability that each such galaxy 
is indeed a red sequence cluster member, as in Eqn. [ij 
Consequently, we can model the contamination of non- 
red-sequence galaxies in our sample, as shown below. 

We emphasize that it is essential that we leverage our 
spectroscopic redshifts to fainter magnitudes to properly 
model the red sequence. In the case of DR8, our ini- 
tial seed galaxy sample is comprised of 42,000 galax- 
ies associated with A > 5 clusters, almost all of which 
are preferentially bright . B y contrast, our final calibra- 



tion sample (see Section 6.5 1 is comprised of over 600,000 
red-sequence galaxies that extend to much fainter mag- 
nitudes. This is illustrated in Figure [2] The magni- 
tude distribution of our seed galaxies in the redshift 
slice z S [0.24,0.26] (solid black histogram) is contrasted 
with the membership- weighted magnitude distribution of 
our final photometrically selected training sample (red 
dashed histogram). We see that the gain in the effective 
number of red sequence training galaxies is enormous, 
allowing for an accurate calibration of the red-sequence 
(amplitude, tilt, and scatter) as a function of redshift. 
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Figure 3. Composite red sequence at 0.25 < z < 0.26 for color- 
selected galaxies with pmcm > 0.9. A linear model (red dashed line) 
with roughly constant intrinsic scatter is a good representation of 
the red sequence in both g — r and r — i. 



We note that we have explicitly verified that our likeli- 
hood treatment of this color cut results in unbiased esti- 
mates of the total scatter. 

6.4. Modeling The Red Sequence 

Given a list of galaxies with multidimensional color 
(c), redshift (z, taken to be the cluster redshift), and 
membership probability (pmom), we can now proceed to 
calibrate the full red sequence model. Our model is well 
motivated by observations of galaxy clusters, in that the 
red sequence at any given redshift in a given color can 
be described by a simple linear relation between color 
and i-band magnitude rrii with intrinsic scatter dint . For 
example. Figure |3] shows the composite red sequence at 
2 = 0.25 for both g — r and r — i colors, for all galaxies 
selected in the final calibration iteration with Pmcm > 0.9. 

Our red-sequence model we is defined in terms of 
smoothly evolving functions of redshift characterizing the 
amplitude and slope of the mean color-redshift relation, 
and the corresponding covariance matrices. We have 
opted to use a cubic spline interpolation to parameterize 
these functions. Given the large number of colors (four 
for SDSS), and broad redshift range, our model neces- 
sarily contains a large number of free parameters. For 
instance, in our SDSS DR8 implementation, we required 
a total of 118 parameters to fully characterize the red- 
sequence model. In principle, we would like to fit the full 
red sequence model simultaneously. However, to make 
the problem more tractable we fit the red sequence pa- 
rameters governing the mean relation and the diagonal 
elements of the covariance matrix one color at a time. 
Once these terms are in place, we fit the off-diagonal 
terms of the covariance matrix. We are also cautious 
that our model does not have too many free parameters 
given the training data such that over-fitting becomes 
possible. As shown in Appendix [C] this can be a prob- 
lem in the case of very sparse training data. 

An additional complication comes from the fact that 
our selection of red sequence galaxies is not entirely clean. 
Our fit of the red sequence must take into account the 
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background density of non-member galaxies, as described 
below. In addition, we also have to contend with blue 
cluster galaxies that are not taken into account by a 
global background term. These blue galaxies will tend 
to have two effects. First, as the blue fraction increases 
at lower luminosities, they will tend to steepen the ap- 
parent red sequence tilt. Second, the blend of red and 
blue galaxies will tend to broaden the apparent intrinsic 
width of the red sequence. 

In order to deal with both of these effects of blue cluster 
galaxies, we have taken a pragmatic approach. When 
fitting the red sequence for a given color, we first perform 
a sharp color cut to concentrate on the core of the red 
galaxy distribution. Naively, this cut would introduce 
biases in the recovered red-sequence model, leading to 
under-estimates of the scatter. We avoid this difficulty 
by explicitly modeling such a color cut into our likelihood 
function. All that remains is to specify the color cut. 
Here, we apply a color cut of 1.5a about the median 
color of the high probability member galaxies, where a 
is the median absolute deviation of the color about the 
median. 

6.4.1. Measuring the Model Mean and Color Scatter 

As noted above, we begin by measuring the model color 
{c\mi,z) as a function of galaxy magnitude and cluster 
redshift z for each color, one at a time. The first step 
in this process is to define the pivot point mi{z) used to 
calibrate the amplitude and tilt of the mean red sequence 
relation at redshift z. We write 



{c\z,mi) = c{z) + s{z)[mi - mi(z)]. 



(29) 



We wish to select a pivot point that is characteristic 
of most cluster members. To do so, starting from our 
Pmem > 0.3 members list, we apply a Pmcm > 0.7 cut. 
Using this sub-sample, we minimize the cost function E 
where 



E ^ 



m^{z)\ 



(30) 



where rni{z) is defined via spline interpolation, and the 
model parameters are the value of rni{z) at the nodes. 

Having defined our pivot point as a function of red- 
shift, we turn to calibrating the amplitude and slop e of 
the mean relation, i.e., c{z) and s{z) in Eqn. [29l As 



a first step, we do a rough estimate of the amplitude 
and scatter, which we will use to isolate the core of the 
color distribution of member galaxies. These rough es- 
timates for the amplitude and scatter are denoted 'c{z) 
and ct(z), and are obtained by selecting galaxies with 
Pmcm > 0.7, an d th en fitting for these functions as was 
done in Section [6.2| Specifically, the functions are spline 
interpolated, with model parameters being the value of 
these functions at the nodes. The best fit parameters 
are found by mini mizin g Eqn. 27 and ct(z) is defined 
by minimizing Eqn. 28 The primary difference between 
these new color estimates and scatter relative to those 
derived in Section [6^ is that these parameters are now 
appropriate to the full red sequence rather than simply 
the (brightest) spectroscopic galaxies. 

We now turn to measuring the actual model parame- 
ters defining the amplitude c{z), slope s(z), and scatter 
C-f*(2;). As before, we use a cubic spline interpolation to 
parameterize these smoothly evolving functions of red- 
shift. For DR8, we have chosen to use a node spacing of 



0.05 for c(z), 0.1 for s(z), and 0.15 for Cf^{z). We have 
found that a relatively tight spacing is required for c(z), 
as this function can change relatively rapidly at filter 
transitions. Fortunately, c(z) is the most robust param- 
eter, and thus is amenable to smaller node spacings. The 
slope and scatter are not expected to vary as rapidly, and 
are also noisier to estimate, so we have chosen wider node 
spacings. Overall, the calibration is not very sensitive to 
the node spacings chosen provided there are sufficient 
calibration galaxies (though see Appendix [C]) . 

Starting from the photometrically selected galaxy 
training set from the previous section, we first apply a 
color cut |c — c(z)| < 1.5a(z), which ensures that the 
red-sequence parameters are based on the core of the red 
galaxy distribution, and are therefore less likely to be bi- 
ased by blue galaxies. In our model, the probability that 
a red-sequence cluster galaxy has a color c is given by a 
truncated Gaussian distribution, 



G(c) = 



/2irCT 



-(c-(c|2,mi))V2cr^ 



erf 



\ V2a ) 



(31) 



where the expectation value (c|mi,z) is defined i n t erms 
of our model functions c(z) and s{z) as per Eqn. |29| and 



the scatter a is the sum in quadrature of the intrinsic 
scatter and the photometric error of the galaxy. 



(32) 



C"j* is the intrinsic scatter of the red 



where crint(z) = 

sequence. The 'erf term in the denominator accounts for 
the fact that G(c) is truncated at c{z) ±cr(z), under the 
approximation c{z) = c{z). This approximation is only 
used in the overall normalization of the distribution. 

The total probability distribution for all of our calibra- 
tion galaxies must account for the fact that some of our 
galaxies are in fact background galaxies, so the full color 
distribution is given by 

P(c) = PmcmG(c) + (1 - Pmcm)fe(c,r71i), (33) 

where b{c,mi) is the distribution in color and magnitude 
of galaxies about random points. The shape of the back- 
ground function is obtained by binning all galaxies in 
color and magnitude bins and using a CIC algorithm as 
in Section 

In the end, our task is to calculate the set of c(z), s(z), 
and CJf{z) values at the given cubic spline nodes that 
maximizes the total likelihood given by 



ln£ = ^InP,. 



(34) 



As above, we accomplish this maximization by making 
use of the downhill-simplex method. The maximum like- 
lihood point defines the model functions c(z), s(z), and 
G'"*(z). We emphasize that the likelihood is explicitly 
truncated as the data is, so that the recovered scatter is 
unbiased relative to the full population of cluster mem- 
ber galaxies, as we have confirmed with simple mock red 
sequences and blue clouds. 

In Figure [4] we show the color evolution of red sequence 
galaxies with Pmem > 0.9 for the g — r and r — i colors 
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Figure 4. Color as a function of redshift for the sample of red- 
sequence galaxies with pmom > 0.9. The red points indicate the 
c(z) values at the spline node positions, and the long-dashed lines 
are the spline interpolation. The short-dashed lines indicate the 
So-jnt range. Note that the colors in the figure are not corrected 
for red sequence tilt. We caution that the intrinsic width of the 
red sequence can be wider than the pmom > 0.9 population of 
galaxies suggests, since high probability membership requires the 
galaxy to fall close to the expected average color. Conversely, the 
larger number of outliers in g — r above reflects the fact that the 
photometric errors in <; — r at high redshift are larger than the 
intrinsic width of the red sequence. 



in DR8. The red points indicate the c{z) values at the 
sphne node positions, and the long-dashed lines are the 
smooth interpolation. The short-dashed lines indicate 
the SfJint range. Note that the colors in the figure are 
not corrected for red sequence tilt. We caution that the 
intrinsic width of the red sequence can be wider than 
naively indicated by the Pmcm > 0.9 galaxies, since high 
probability galaxies must reside closer to the average red- 
sequence model. 

6.4.2. Measuring Cfk{z) 

With the intercept and slope of the red sequence in 
hand, as well as the diagonal elements of the covariance 
matrix, we now estimate the off-diagonal elements of the 
covariance matrix, Cj^*(z). Once again, we use a cubic 
spline interpolation, with the same 0.15 node spacing as 
used for C]f (z). 

In order to make the calculation tractable, to constrain 
the off-diagonal elements of the covariance matrix we 
consider the problem piecewise, tackling two colors at 
a time. Each individual piece of the covariance matrix 
constrained in this way will be positive-definite and thus 
a valid covariance matrix. Unfortunately, due to noise in 
the estimation of the parameters, this method does not 
guarantee that the total covariance matrix, Cint(z), will 
also be positive-definite. 

To ensure that Ci-o_t{z) is positive-definite, we constrain 
the parameters for pairs of colors in a specific priority or- 
der, ensuring that the best constrained colors have prece- 
dence. In the case of DR8 data in the redshift range 
z e [0.05,0.6], these are g — r and r — i. Then, at each step 
in the downhill-simplex estimation described below we do 
not allow any terms in Cj^'(z) that result in a minimum 
eigenvalue in the total covariance matrix Cint(-2) that is 



less than 0.01^. In this way, the first color pair to be 
constrained {g — r,r — i) is essentially free, while the final 
(and noisiest) color pair to be constrained (u—g^i — z) will 
not result in a non-invertable covariance matrix Cint(-z)- 
To perform the pairwise constraints on the off-diagonal 
elements, let us consider the residuals in two colors Xj 
and Xk Eqn. [29] 

X = c — {c\mi,z) — (c(z) + s{z)[mi — m^(z)]). (35) 

The probability distribution function is again a Gaussian, 
though this time we explicitly leave the covariance matrix 
in the equation: 



G(x) 



1 



/2^|C|i/2 



exp 



(36) 



where x = {xj,Xk} is the vector of residuals, and the 
total covariance matrix C is 



Ci„t(2) +Cc„.(z). 



(37) 



Here Cint(^) and Ccn{z) are the covariance matrices 
characterizing the intrinsic scatter and photometric error 
respectively. The intrinsic scatter is simply 



Cint — 



2 

^int,j 



^0'int,jO'int,fc 



(38) 



where aj and Ufe are known from the previous section, 
and r is the only unknown. The covariance matrix 
Coir (2) is derived from the photometric error in each 
band. Given two colors Cj — ma — mp and Ck = rriy — ms, 
the covariance matrix characterizing the photometric er- 
ror is given by 



Cerr(^) 



and 



■n 







ff7 = /3 
otherwise. 



(39) 



(40) 



Here, we are assuming that neighboring colors are of the 
form Cap and c^a, i.e., that the "shared" magnitude is 
mp — m-y. The covariance between photometric errors 
arises precisely because, for example, the neighboring col- 
ors g — r and r — i are both derived from the same r-band 
magnitude. 

The color distribution function of the full galaxy pop- 
ulation is again given by Eqn. |33[ noting that now the 
background term b{cj,Ck,mi) is given by a three dimen- 
sional binning in two colors and i-band magnitude. In 
addition, we implement a prior on r with mean and 
width 0.45 for each of the nodes. We find that this prior 
reduces the noise in the parameter constraints, which 
is especially important at high redshift where the pho- 
tometric errors dominate and the covariance matrix is 
largely unconstrained. At the same time, this prior al- 
lows high correlations (r ^ 0.9) if strongly favored by the 
data. Our total likelihood is now given by: 



(r„/0.45)2 



(41) 



where is a sum over all the nodes, and r„ is the cor- 
relation coefficient at that node. That is, the prior is 
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placed at each of the nodes. Maximization of the hkeh- 
hood function defines the final values for the correlation 
coefficients that characterize the intrinsic scatter covari- 
ance matrix. 

6.5. Iterating The Red-Sequence Model 

We emphasize that the estimation of the red sequence 
parameters in the previous section depends on the mem- 
bership probabilities (pmcm) of the red sequence galaxies. 
Of course, the membership probabilities themselves de- 
pend on the red sequence model. In order to obtain a red 
sequence model that is consistent with the membership 
probabilities, we take an iterative approach. 

After we calibrate the red sequence parameters based 
on single color membership probabilities, we run the clus- 
ter finder on the training data, as described in Section [9j 
During these calibration runs we restrict ourselves to 
finding clusters associated with our seed galaxies so that 
we can affirmatively associate a spectroscopic redshift 
with each cluster. The resulting cluster catalog includes 
cluster member lists and new membership probability es- 
timates Pnicm based on the full color model. With these 
in hand we can re-estimate the red sequence model as 
described in Section lOl 

As we iterate, the largest shifts in the model occur be- 
tween the first and second iteration, reflecting the shift 
from estimating membership probabilities based on a sin- 
gle color, and estimating membership probabilities with 
the full multi-color data. As detailed in Appendix [Xj for 
DR8 data the color at the reference magnitude mi{z) 
and slopes characterizing the average color of red se- 
quence galaxies converge quickly, and is generally well 
measured, except for u — g aX high redshift, where the 
large photometric errors in u make our model estimates 
noisy. The scatter model, on the other hand, converges 
slowly, particularly at high redshift, where the intrinsic 
scatter is often sub-dominant to photometric errors. As 
we now show, however, by the third iteration our model 
is well converged. 

We define convergence of the red sequence model in 
terms of the relevant quantity for our purposes, i.e., the 
cluster richness A. That is, we require that cluster rich- 
ness estimates be insensitive to further iterations. To this 
end, we have run the calibration through ten iterations. 
Given the red sequence model for each of these ten iter- 
ations, we estimate the photometric redshift and cluster 
richness of a standard set of galaxy clusters while fixing 
the central galaxy of these systems. Let then Ai and Zi 
denote the richness and redshift estimates from iteration 
i. We bin the clusters in narrow redshift slices (±0.01), 
and we calculate: 1- the median ratio A^/As, and 2- the 
median offset (A^ — A3) /as, where is the error estimate 
in the richness as estimated from iteration 3. 

In Figure[5]we show the results of these iteration checks 
for the first 6 iterations in the DR8 training region. Even 
for the first iteration, for which Pmcm was estimated using 
a single color, the bias is always < 10% (though at the 
lowest redshift that shift is ^ la). However, after the 
third iteration, the biases are always < 1% at low redshift 
and < 5% at high redshift. The bottom panel shows that 
after the third iteration the biases are < O.lcr. Thus, we 
rely on the output of our third iteration for our final 
cluster catalog. 
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Figure 5. Top: Average richness bias as a function of redshift 
for the first six iterations of the red-sequence model for the DR8 
training region, as compared to A3, the richness computed in the 
third iteration. Even for the first iteration, the bias is < 10% at all 
redshifts. After the third iteration, the biases are always < 1% at 
low redshift and < 5% at high redshift. Bottom: Error normalized 
average deviation relative to the baseline. After the third iteration 
the bias is always < O.lcr. 



7. PHOTOMETRIC REDSHIFT ESTIMATION 

At the end of our calibration we have a complete red 
sequence model as a function of redshift. Note, how- 
ever, that in order to estimate the richness of a photo- 
metric cluster we need to know the cluster redshift. If 
we have some initial, reasonably accurate redshift guess 
^init for each cluster, we can estimate the cluster richness 
and determine the high probability cluster members. We 
then simultaneously fit our red sequence model to all high 
probability cluster members to derive an improved red- 
shift estimate, and iterate this procedure through con- 
vergence. We now describe this full procedure in detail, 
including the construction of our initial cluster redshift 

guess Zinit. 

7.1. Redshift Initialization: z-^^d 

For the full SDSS DR8 survey, we have multiple photo- 
metric redshifts based on large training sets (e.g.,jCsab^ 
et al.|2 007: She ldon et al.|2012 ). However, these niethods 
have certain limitations. 1* irst, they require training sets 
that span a broad range of magnitudes, which although 
abundant at z < 0.5 for SDSS data, will be much sparser 
at higher redshifts for large surveys such as DES. Second, 
th ese methods — - in par ticular p{z) methods such as that 
of Sheldon et al. (2012) — are very good at estimating 



the ensemble of redshifts for a broad class of galaxies. 
However, our needs are much more specific: we wish to 
have a good initial single-value estimate of the redshift 
of the central galaxy of galaxy clusters to initialize our 
cluster photometric redshift estimation procedure. To 
that end, we have developed our own photometric red- 
shift estimator Zred which is specifically designed to work 
on red sequence galaxies. 

Given a red-sequence galaxy at redshift z with i-baiid 
magnitude m^, color vector c, and photometric error 
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Ccrriz), the probability distribution of its color is simply 

' (42) 



P(c) cx exp [ --X" 



where is given by Eqn. [oj i.e., 

= (C - (c|z,r71i)) (Cint(2) + Corr(z))"^ (c - {c\z,mi)) . 

. (43) 

The corresponding log-likelihood is therefore simply 
ln£ = —O.bx^- In practice, we also include an additional 
volume prior that accounts for the fact that there is more 
volume at higher redshifts. Assuming that the luminosity 
function does not evolve over the redshift uncertainties, 
the probability that a galaxy of a given luminosity is at 
redshift z is 



Poiz) « 



dV 



dz ■ ^ 

which leads us to the likelihood 



{l + zfD\{z)cH-\z), (44) 



Inured = - 



In 



dV 

dz 



(45) 



The redshift estimator Zjod is that which maximizes the 
above likelihood. We use the "red" subscript to indicate 
that the redshift estimator assumes a red sequence galaxy 
model. We maximize the likelihood along a redshift grid 
with 5z = 0.005, and then use parabolic interpolation 
to find the correct maximum. This search is restricted 
to galaxies with < m*(z) +2.5, since galaxies fainter 
than this fall well below the luminosity threshold us ed t o 
define cluster richness (recall m* (z) is defined in Sec. 4.2 ). 
The error estimate for z-^cd is estimated as the standard 
deviation of the redshift over its posterior, i.e. 



where 



^ j dz C,ediz)z"' 
J dz C^cdiz) 



(46) 
(47) 



We could, of course, store the posterior of the redshift 
distribution, but we have chosen not to do so since the 
only use of z^-cd in the redMaPPer algorithm is that of 
providing an initial redshift estimates for galaxy clusters. 

The top-left panel of Figure [6] shows Zrcd for DR8 clus- 
ter training galaxies with Pmom > 0.9 versus the spectro- 
scopic redshift of the corresponding central galaxy zqg ■ 
We see that Zj-^d performs very well, with low bias and 
scatter, and very few gross outliers. The "fiare-up" of 
the points around zqg ~ 0.35 is due to the 4000 A break 
moving from the g — r to the r — i color. 

The performance of Zicd is better illustrated in the 
bottom-left panel of the same figure. The black triangles 
show the mean offset z^od — ^CG in redshift bins, the blue 
dashed line shows the average error in Zj-od as estimated 
above, while the red-dashed line shows the observed rms 
of the redshift offset in each of the redshift bins. The 
magenta dotted line shows the fraction of 4(7 outliers. 
It is clear from the figure that our errors are somewhat 
overestimated, and that there is a small redshift bias in 

Zvcd • 

We correct for the deficiencies revealed in the left panel 
of Figure |6] by applying an afterburner. Specifically, for 



the above cluster sample we define the mean redshift 
offset as a function of redshift, 



dz{z) = {{z% - zcg)\zcg) 



(48) 



where zj?^,^ is the original, uncorrected redshift estimate 
defined above. That is, dz{z) is the curve traced by the 
black triangles in the bottom-left panel of Fig. [6j We 
define a corrected Zrcd redshift, as the solution to the 
equation 

^rod = 2°cd + "^^(^rcd) (49) 

In practice, the above treatment is slightly simplified, 
since our correction afterburner allows for the redshift 
bias to be a function of mag nitude. For details, we refer 



the reader to Appendix B.l 

In the right panel of Figure [6] we show the corrected 
value of Zi-od as a function of zcG after applying our after- 
burner, again for a sample of galaxies with Pmem > 0.9. 
The notation is the same as for the left panel. The bi- 
ases are improved at high redshift, although there are 
still some residual issues at z ~ 0.4 where Zrod is biased 
by ^ 0.3ct. We also note that the afterburner removes 
residual biases observed as a function of (not shown). 
The overall small bias and scatter in Zj-od allows us to use 
this photometric redshift estimate as a good initial guess 
with which to initialize our photometric cluster redshift 
estimator. 

7.2. Cluster Redshift Estimation: z\ 

Our approach to computing the cluster photometric 
redshift z\ is essentially an iterative extension of z^^d- 
Specifically, given a central galaxy candidate, we: 

1. Start with a cluster redshift z^.^, where i indexes 
the iteration. In the first iteration, we set z^.o — 

^red ■ 

2. Calculate the richness A around the candidate cen- 
tral galaxy setting Zdustcr = -za,!, and get the asso- 
ciated set of membership probabilities Pmcm- 

3. Select high membership-probability galaxies to es- 
timate a new redshift z\ ij^i by maximizing the like- 
lihood function given by Eqn. |5l| below. 

4. Repeat from step [2] until convergence, such that 
kA,»+i - 2A,d < 0.0002. 

All that remains then is the definition of a suitable 
likelihood function. To begin with, let us assume that we 
have a sample of known cluster member galaxies. Then, 
the log-likelihood of the observed colors for these galaxies 
would be 



xr inici 



(50) 



Eqn. 43 we take into account the log of the determinant 
of the covariance matrix, ln|C|. We have found that, 
unlike the case of Zred, including this term improves the 
performance of zx when the intrinsic scatter is varying 
rapidly. This makes sense, given that when utilizing mul- 
tiple galaxies, one can directly probe the scatter in the 
red sequence, which is an observable that is inaccessible 
when estimating single-galaxy photo-zs. 
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Figure 6. Left, top: Uncorrected photometric redsiiift Zj^d f™' cluster member galaxies in DR8 with pmcm > 0.9, as a function of the 
central galaxy spectroscopic redshift zqg- Left, bottom: The black triangles show the mean redshift offset Zj^d ~ ^cG in several redshift 
bins. The red long-dash line is the rms of these offsets, while the blue short-dash line is the average estimated redshift err or. The dotted 
magenta line is the fraction of 4cr outliers as a function of redshift. Right, top: Corrected photometric redshift Zjcdi using [49] for cluster 
member galaxies in DR8, as in left panel. Right, bottom: Bias, scatter, and outlier fraction, as in left panel, now for the corrected redshift. 



Of course, in practice, we do not have a list of known 
members, but rather a Ust of Ukely members with mem- 
bership probabihties. One might be inchned to adopt a 
sharp cut Pmom > Pmin in order to define a Ukehhood that 
can be used to estimate the cluster redshift. However, we 
find that a sharp cut in Pmom leads to numerical instabil- 
ities in the iterative process because galaxies can scatter 
in and out of the sample in the course of the iteration. 

To overcome this problem, we adopt instead a soft cut, 
and define a new likelihood 



w x' + ln|C| 



(51) 



where each galaxy contributes a weight w that smoothly 
varies from w = 1 at Pmom = 1 to w = at Pmcm = 0. 

The assignment of these weights is somewhat ad-hoc. 
We assume w(pmGm) follows a Fermi-Dirac distribution. 
The transition from w ~ to w — 1 occurs at p7o, which 
is the probability threshold that accounts for 70% of the 
total richness, i.e., 



0.7A = 



,om>P7 



(52) 



The advantage of defining the probability threshold in 
this way — as opposed to a redshift independent thresh- 
old pcut — is that j>7o varies with cluster redshift in such 
a way that one always uses the same fraction of clus- 
ter galaxies when estimating redshifts. Were we to take 
a constant Pmem cut, the number of galaxies contribu- 
tion to z\ would decrease with increasing redshifts, since 
galaxy Pmom values decrease as the photometry becomes 
noisier. The width of the distribution is set to 0.04, which 
we found is sufficient to regularize the iterative process. 



Thus, our galaxy weights are defined via 

exp [{p70 - Pmom)/0.04J + 1 

In Figure [7] we illustrate how the iterative process in 
our redshift estimate works. Fundamentally, each loop 
in the iteration takes a value for the redshift, and 
produces a redshift Zoutj and we wish to find the stable 
point where Zout — ^in- In the figure, we show 2:out(2;in) 
for three sample clusters. For the two typical clusters 
denoted with red short-dashed lines, this function is well 
behaved, and we quickly achieve convergence. However, 
there are also '--^ 1% — 2% of clusters that have conver- 
gence curves like the blue long-dashed line. These ap- 
pear to be projection effects betw een multiple nearby 
structures. As detailed in Section 9.3, redMaPPer of- 
ten fragments these clusters along the line-of-sight, as it 
should. However, which cluster is "dominant" and which 
is a satellite depends on the initial photometric redshift 
estimate (za,o)- 

Given an estimate for zx, we can also map out the 
posterior P{ztruc\zx). Defining xloim via 



norm 



InlCI 



( ^ ['^^ + In I C I 



we adopt the posterior 



P{ztvuc\zx) 



exp(-xLrm/2) \dV/dz\ 

exp(-xLrm/2) \dV/dz\ 



(54) 



(55) 



where dV/dz is the comoving volume per unit redshift. 
The above expression defines our estimate of the redshift 
probability distribution of each cluster. In addition, we 
fit this distribution with a Gaussian to estimate the red- 
shift error (7,, . 
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Figure 7. The redshift difference Zout — ^in for one loop of our 
iterative photometric redshift estimator, as a function of the input 
redshift Zin. Two typical, well behaved clusters are shown with red 
short-dashed lines. However, ^ 1 — 2% of clusters have convergence 
curves like the blue long-dashed line. These appear to be projection 
effects between multiple nearby structures. 



Finally, in order to ensure that Zx is unbiased, we apply 
an afterburner correction, much in the same way as was 
done for Zred, only now we demand that the redshift be 
unbiased in the sense that (ztrucl-ZA) ~ z\. We relegate 



the details to Appendix |B.2| 

In the top panel in Figure [8] we compare our photo- 
metric redshift estimates to the spectroscopic redshift of 
the central galaxy (where available) for all clusters in 
DR8 with \/S{z) > 20 (i.e., every cluster must have 20 
galaxy detections). The bottom panel shows the residu- 
als (red triangles), as well as the rms of the distribution 
(red long-dashed line) and average estimated error 
(blue short-dashed line). There are small biases that are 
nevertheless detected with high confidence. We do not 
yet fully understand the origin of these biases, but in- 
tend to return to this problem in a future paper. We see 
too that there is a feature at 0.35 ^ z 0.45, both in 
the bias and scatter, reflecting the additional difficulties 
introduced by the fact that the 4000A break goes from 
being sampled by g — r to r — i. This is also the red- 
shift range where we start running into the limit of the 
DR8 photometry, which further aggravates these failures. 
Indeed, these features are greatly reduced when redMaP- 
Per is run on dee per data (e.g., SDSS Stripe 82 coadds, 
Annis et al.||201l" not shown). 

One interesting thing to note about the top panel in 
Figure [s] is that the "large" {Az '-^ 0.1) redshift offsets in 
this plot do not reflect errors in the cluster redshift esti- 
mates, but rather cluster miscentering. That is, when we 
compare z\ to the redshift of the central galaxy, large off- 
sets are primarily due to our selection of a central galaxy 
that is not, in fact, a cluster member. To demonstrate 
this, we have created a "clean" sample of clusters where 
we demand that there be at least two spectroscopic clus- 
ter members with Pmem > 0.8 within 1000 km s~^ of the 
spectroscopic redshift of central galaxy, thereby ensur- 
ing that the central galaxy is in fact a cluster member. 
Of the 13,178 redMaPPer clusters in DR8 with spectro- 
scopic redshifts, 1,829 (or 14%) meet this criterion. The 
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Figure 8. Top: z\ vs. spectroscopic redshift of the assigned 
cluster central galaxy (CG) for redMaPPer clusters in DR8 with 
X/S{z) > 20. Bottom: Red triangles show the mean offset zx — 
ZQG in various redshift bins. The blue short-dashed line shows 
the average redshift error on z\, while the red long-dashed line 
shows the measured rms of the redshift offset distribution. The 
vast majority of outliers are due to errors in cluster centering, i.e., 
the offset zx — ZCG is large not because zx is incorrect, but rather 
because the chosen central galaxy is not actually a cluster member. 



corresponding comparison of zx to zqq in this case is 
shown in Figure [9j We see that this photometric redshift 
plot is very clean. The few outliers left (< 0.2%) are 
likely multiple systems in projection. In particular, the 
obvious outlier cluster at zx ~ 0.22 correpsonds to the 
cluster represented by the blue long-dashed line shown 
in Figure [7] 

We can get a better sense of the fraction of gross red- 
shift outliers from Figure [TOj where we show the fraction 
of 3cr, 4ct, and 5a outliers!~A cluster is considered an Na 
outlier if \zx — zcg\ > -^ct^a- To estimate the fraction of 
outliers as a function of redshift, for each redshift z we 
collect all clusters with redshift zx S [z — 0.025, z-l-0. 025], 
and directly measure the fraction of Na outliers. By 
moving the window [z — 0.025,z -t- 0.025] we recover the 
outlier fraction as a function of redshift. We see that 
w 1% of our galaxy clusters are Aa redshift outliers. We 
emphasize that this fraction is measured using the full 
cluster sample, not the cleaned version used to produce 
Figure 9j 

Finally, in Figure [TT] we test whether the redMaPPer 
estimates for the cluster redshift probability distribu- 
tions P{zti-uc\zx) are accurate. First, we select all clus- 
ters with spectroscopic central galaxies to create a "true" 
N{zcg), shown with a black solid histogram. We note 
that this is not representative of the full cluster popu- 
lation due to uneven spectroscopic sampling. We com- 
pare this to two estimates of N{z) using the same set 
of clusters. First, we bin clusters using the central val- 
ues of Zx, shown with the red-dashed histogram. Second, 
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Figure 9. zqq, vs. Z\ as in Figure [s] but demanding that the 
cluster contain at least two cluster members with pmom > 0.8 
with spectroscopic redshifts within 1000 km/s of the redshift of 
the assigned central galaxy. This removes clusters centered on 
non-cluster-member galaxies. Of the 13,178 redMaPPer clusters in 
DR8 with spectroscopic redshifts, 1,829 (or 14%) meet this crite- 
rion. The few remaining outliers (< 0.2%) appear to be redshifts 
failures from multiple systems in projection. 
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Figure 11. Comparison of the true and predicted N{z) distribu- 
tion for redMaPPer DR8 clusters with central galaxies with spec- 
troscopic redshifts and X/S > 20. We note that this is not repre- 
sentative of the full cluster population due to uneven spectroscopic 
sampling. The black solid histogram shows the "true" N(zcg)- 
The red-dashed histogram shows the results of binning the central 
values of zx for the same clusters, leading to obvious biases. The 
yellow band (ztlcr errors) shows the results of summing the cluster 
P{z) values, and provides a good fit to the data. 
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Figure 10. Fraction of redshift outliers, as a function of photo- 
metric cluster redshift. A cluster is said to be an Ncr outlier if 
\zx — zqgI ^ Nf^zx- show the fraction of Scr, 4(t, and Scr out- 
liers, as labelled. These are computed using the full redMaPPer 
cluster sample, with no additional spectroscopic requirements on 
member galaxies (unlike in Fig. |9|. 



we integrate J2 P(^truc\zx) over the appropriate redshift 
bins, shown with a yellow band (including the expected 
measurement errors and Poisson sampling, ±1ct). The 
red-dashed histrogram is obviously not a good fit to the 
spectroscopic redshift distribution. In particular, there 
is an artificial peak near the filter transition at z = 0.35. 
This is properly smoothed out by our probability distri- 
bution estimate (yellow band), which is a good fit to the 
spectroscopic data {x^/dof — 45.0/40). 



The issue of galaxy cluster centering is very impor- 
tant for constraining cosmology with photometric sur- 
veys. In particular, miscentered clusters are a leading 
source of systematic err or in stacked weak-lensing mass 
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as mean velocity disper- 
In addition, the cluster 
richness estimates themselves depend on the choice of 
center. Thus, a well-characterized centering model is es- 
sential for precision cosmology. 
We assume every galaxy cluster halo h as a bright, dom- 



inant galaxy residing at its center (e.g., von der Linden 
eraL][2007l [201^ ; Mena nteau et a l. ^OTst'Ma hdavi eFaT 
2012 



n^ lSong et al..2012bfPtotret al.,,2012, see also pa- 
ll). In our current implementation, we also assume 



per 

that the central galaxy is red, which is the case for the 
vast majority of massive clusters. The exceptions are 
strong cool-core clusters such as Abell 1835, where there 
is enough star formation for the broadband color of the 
central galaxy to no longe r be consistent with tha t of a 
red sequence galaxy (e.g., McNamara et al. 2006). Al- 



though blue central galaxies are mor e common ( although 



still rare) at the group scale (e.g., 


George et al. 


2011 


More et al. 


2011 


George et al.|2012 


Tinker etal.|2012 1 



scale at which this is an issue. 

Miscentering of galaxy clusters where the central 
galaxy is undergoing strong star formation is a known 
failure of the redMaPPer centering algorithm (see Paper 
II) . Simply removing the requirement that central galax- 
ies be consistent with the red sequence — i.e. relying 
solely on luminosity and proximity — can fix some of 
these clusters, but at the expense of miscentering ^ 10% 
of the clusters on foreground galaxies^ Likewise, our 



8. CLUSTER CENTERING 



We note that foreground galaxies are much more likely to be 
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tests have shown that both galaxy centroids and lumi- 
nosity weighted galaxy centroids result in worse center- 
ing properties than the al gorithm current ly implemented 
below (e.g., see also , George et al. 2012). Thus, center- 
ing on red galaxies is, as tar as we can tell, the "least 
bad" option. In its current implementation, the center- 
ing success rate is « 85% (see Paper II). We intend to 
continue working on improving our centering model for 
future data releases, as this is currently the dominant 
source of systematic failures in the redMaPPer cluster 
catalog. 

8.1. Basic Framework 

We introduce a fundamentally new way of thinking 
about identifying the central galaxy of a cluster: rather 
than specifying a unique cluster center, redMaPPer esti- 
mates the probability that a given galaxy is the central 
galaxy of the cluster. Some clusters have well defined 
cluster centers, exhibiting a single galaxy with a center- 
ing probability Pcen ~ Ij whereas others can have two or 
more reasonable central candidates, with the most likely 
center having Pcen ~ 50%. We note that these centering 
probabilities are the angular-position equivalent of the 
standard photo-z distributions P{z). That is, just as a 
cluster has an uncertain redshift position characterized 
by a redshift probability distribution, so too does the 
cluster have an uncertain angular position on the sky, 
characterized by the probability of any given galaxy of 
being the correct cluster center. The importance of this 
new way of treating cluster centering is that it opens up 
the possibility of a statistical treatment of cluster cen- 
tering akin to the statistical treatment of photometric 
redshifts, allowing us to improve our estimates of the 
cluster richness functions and cluster correlation func- 
tions. A detailed description of this framework will be 
presented and tested in a future work. 

The key insight that allows us to estimate centering 
probabilities is that there are three different types of 
galaxies in a cluster: a central galaxy ("CG"), satellite 
galaxies, and unassociated foreground and background 
galaxies. Let x be an observable vector for a galaxy, 
e.g., color (via z-^cd), luminosity (m^), and position of 
each galaxy. We define Ucon, "sat, and Mtg as the distri- 
bution of X for central, satellite, and background galaxies 
respectively. The Ucen and Usat filters are assumed to de- 
pend on cluster redshift and richness, while Ufg depends 
only on cluster redshift (via Zrod). We use the subscript 
"fg" as we expect foreground galaxies will be more likely 
to be misidentified as CGs. Given a galaxy with observ- 
able a;, the probability that it is the central galaxy of a 
cluster is 



Pccn{x\\,Zx) = Pfrc 



I \ I ' 



(56) 



where phcc is the probability that a galaxy has not been 
partially mask ed by a higher ranked cluster (as described 
in Section |973 typically pfreo ~ 1), and Agat = A — 1 is the 



total number of satellite galaxies. This formula can be 
thought of as the simple definition of probabilities, or it 
can be interpreted as a Bayesian classification algorithm. 

confused as centrals than background galaxies because they tend 
to be brighter in apparent magnitude 



Note, however, that the probability pccn is not the same 
thing as the probability Peon that the galaxy is the unique 
central galaxy of the cluster. By assumption, there can 
be only one central galaxy, so if galaxy i is the central 
galaxy, then every other galaxy j ^ i must not be a 
central. Consequently, the probability that galaxy i be 
the central galaxy of a cluster is 



^CGW ^ VccniS^i) J__j_(-i 7^ccn('^j))- 



(57) 



The proportionality constant is set by the condition that 
there is just one central galaxy in the cluster. 



1 ^ ^ PcQwiS^i) • 



(58) 



In addition to the central galaxy probability, we can 
also calculate the probability that a cluster is centered 
on a satellite galaxy, given by 



-^sat (f -^ccn) 



Asat^sat 
Asat^sat + Ufg, 



(59) 



All that remains for us to be able to estimate centering 
probabilities is the definition of the filters Wccn, i*sat, and 

Ufg. 

8.2. Centering Filters 

With the basic formalism laid out, we need to specify 
the observable x and the corresponding filters. There are 
three observables that we use to select the CG: the galaxy 
i-band magnitude m^; the red-sequence photometric red- 
shift Zrcd of the galaxy; and a weight w that characterizes 
the local cluster galaxy density around the proposed cen- 
tral galaxy. We also explored replacing our photometric 
redshift z^^d with , the "distance" in color space to the 
red sequence. However, we have found empirically that 
z-ccd works better for estimating central probabilities, in 
that small amounts of star formation and/or small color 
errors due to deblending have a much smaller impact on 
Zrod than they do on ■ We consider each of the filters 
in turn. 

8.2.1. Luminosity Filter: (j)ccn 

The magnitude of the CG is correlated with both rich- 
ness and redshift, so we define the CG magnitude filter 



0con(nT-i|"^nO-m) 



1 

' 27rcr„ 



exp 



{mi - niiY 



(60) 



where in principle both and Um depend on richness 
and redshift. In practice, we expect a^n to be roughly 
redshift independent, whereas rfii obviously depends on 
redshift. We assume that ifii traces m*(2;), so that the 
full richness and redshift dependent parameterization of 
fhi is 



m^{zx,\) = m.^(zx) + Ao + Ai In 



(61) 



where Aq and Ai are redshift independent constants, and 
Ap is the median richness of the sample. Our algorithm 
for fitting for Ag and Ai is detailed below. 
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8.2.2. Photometric Redshift Filter: Gccn(zrcd) 

For the photometric redshift filter, we use the red- 
sequence photometric redshift z-^cd for each galaxy in the 
field. We model this as a Gaussian function, with the 
form: 



Gccnl^rodkA) — 



1 



exp 



(62) 



As the error in the single galaxy photometric redshift 
dominates that from the cluster photometric redshift, 
we have set the scatter in G'cen(-ZrGd) to that of the in- 
dividual galaxy. In addition to the photometric redshift 
filter, we employ a hard cut such x^i^red) < 100. In- 
vestigations of DR8 spectroscopic galaxies have shown 
that galaxies with > 100 are all catastrophic out- 
liers in Zrod, which is not surprising considering the bad 
fit to the red-sequence template. By allowing galaxies 
with < 100, we allow some flexibility for galaxies that 
have slightly offset colors to still be considered as cen- 
tral galaxies. This is especially an issue for SDSS DR8 
for bright, nearby central galaxies that may have color 
shifts caused by deblending problems. 

8.2.3. Local Galaxy Density Filter: fccn{w) 

The motivation behind the local galaxy density filter is 
to define an observable w that is a pseudo-gravitational 
potential connecting each galaxy to every other cluster 
member. The weight w assigned to a given central can- 
didate is 



= In 



(63) 



where the sum is over all galaxies within the scale radius 
i?c(A) around the candidate central, Tc = 50/i~^kpc is a 
core radius used to soften the l/r dependence, Li is each 
galaxy's i-band luminosity, and Pmcm are the usual A 
membership probabilities. The denominator is chosen to 
make the argument of the natural log dimensionless, and 
to remove the obvious dependence of the numerator of w 
on the total number of terms in the sum. Normalized in 
this fashion, we expect that w does not scale with cluster 
richness nor redshift. 

We assume that for central galaxies, w follows a log- 
normal distribution fccn{w), 



/ccn(w) 



1 



27r cr. 



exp 



(ln(it;) 



2al 



(64) 



As noted above, we expect Wccn to be richness and red- 
shift independent. On the other hand, will certainly 
depend on richness. The noise in w should scale with 
raw galaxy counts (A/S*)^/^, where S{z) is the redshift- 
dependent factor that relates the raw-galaxy counts to a 
richness estimate when the survey is not sufficiently deep 
to reach 0.2L* at the redshift of the cluster (see Eqn. 22 ). 
For Poisson noise, we set 



A 



-1/2 



(65) 



where Cuj^cen is a constant that we fit for. As above, the 
pivot point Aj, should be chosen to match the median 
richness of the sample. 



With these definitions, the product 

^ccn — ^ccn 

[mi I ZA , A) Gccn {Zrcd ) /con {w\zx,\) (66) 

is the filter characterizing the distribution of central 
galaxies. 

8.2.4. Satellite Filter: Uaa.t 

Satellite galaxies on the red sequence can be described 
by a filter function analogous to Eqn. |66| Therefore, we 
have 

"sat = (^sat(TOi|A,m*)Gsat(^;rcd)/sat(wkA,A), (67) 

where fsati^) is defined in the same way as Eqn. |64[ ex- 
cept with parameters appropriate for the satellite galax- 
ies, Wsat and Cm^sat- The satellite luminosity function, 
0sat: is a Schechter function as described in Eqn. [8j The 
redshift filter Gsat('Zrcd) is identical to Gcon('Zrod)- 

8.2.5. Foreground Filter: Ufg 

The foreground filter is defined as the expected number 
of unassociated galaxies within the cluster radius Rc{X), 



Ufg = Sg,2(TOi,Zrod)/fg(w) 



nRi 



where E 
per z, 



g,z is the background density per deg per rrii 
d, calculated in a similar to fashi on a s the red 

In addi- 



sequence background described in Section 4.3 



tion, the area subtended by the cluster in Mpc must 
be converted to deg^ via the angular diameter distance 
(Ia, with cIa measured in Mpc/ deg. Finally, the /fg filter 
describes the connectivity filter from Eqn. |64| with pa- 
rameters appropriate for random points (iDfg and ctio, fg) 
as described below. 

8.3. Implementation 

Implementing this formalism requires that we calculate 
the parameters that describe the filters for central galax- 
ies, satellite galaxies, and foreground/background galax- 
ies. Of course, calibrating these parameters depends on 
having a training sample to start with. As usual, we ap- 
proach this problem in an iterative fashion, where the 
centering model is constrained at the same time as the 
redMaPPer red-sequence model. In the first iteration, we 
generate a catalog with roughly correct centering, and 
use this to provide an initial calibration of the filters. In 
subsequent iterations we make use of the centering fil- 
ters and use the output to recalibrate. This procedure is 
iterated until convergence. 

8.3.1. First Iteration and Initial Filter Calibration 

First, we implement a rough centering algorithm: for 
every cluster, we simply select the brightest high proba- 
bility (pmem > 0.8) member galaxy as the central galaxy 
of the cluster. In this fashion, we obtain a full train- 
ing catalog with a set of central galaxies that should be 
roughly correct. 

We now use this first iteration of a central galaxy (CG) 
catalog to determine the filter parameters that we will 
use in subsequent iterations. Note that because the ini- 
tial CG catalog contains some miscenterings, when cali- 
brating the CG filters it is important to account for this 
contamination. 
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If a cluster is improperly centered on a satellite galaxy, 
it is most often centered on the brightest satellite. Con- 
sequently, the luminosity distribution of satellites which 
are mistaken as central is not simply a Schechter func- 
tion (0sat)- Rather, the luminosity distribution of satel- 
lite galaxies in the CG catalog is given by 0i(mi|A,m»), 
the magnitude distribution of the brightest satellite in 
clusters of richness A. The expected magnitude and Zrod 
distribution of the galaxies in our CG catalog is 



p{mi,Z,cd\z) =Pcon?!'ccn("li)G(Zrcdk) 
+ Psat0l(f«i)G(Zrcdk) 



(1 - Pc, 



(69) 



where Pccn is the probability t hat the galaxy in question 
is the central galaxy, as in Eqn. |57[ and Pg^t is the proba- 
bility that the cluster is centeredon the brightest satellite 
galaxy. The redshift, z, is the spectroscopic redshift of 
the "seed" galaxy used in the training step. 

Our primary goal is to constrain the parameters Aq, 
Ai, and am- However, we also have the parameters Pccn 
and Psatj which are unknown in the first iteration. For 
these parameters, we have found that setting them in the 
first iteration at any reasonable initial estimate (Pcen G 
[0.7,0.9]; Psat e [0.05,0.2]) has no marked effect on the 
final calibration of the filter parameters. Therefore, for 
simplicity we set Pcen = 0.9 and Pgat = 0.05 for each 
individual cluster in this first iteration. 

Before we can continue, we must estimate the param- 
eters for (^i(mi|A,m*). This is modeled as a Gaussian 
distribution with central value rhgat = m*(z) + c^ -, + 
ln(A/Ap), where m*(2) is obtained from Eqn. 4.2 and 
Xp is the median richness of the sample. The width of 
the distribution is similarly modeled as 



fsat 



T,0i ln(A/Ap). 



(70) 



The central value must scale with richness because as 
we sample more galaxies from the luminosity function, 
we are more likely to find a very bright galaxy. In or- 
der to obtain these parameters, we run a simple Monte 
Carl o wi th the luminosity function parameters from Sec- 
tion l4?2j For a ~ —1.0 with Xp — 30, wc find that c^. = 
-O.g^S0, = -0.32, c^^^j = 0.40, and s,,,^, = -0.09. 
We note that these parameters depend only on a and 
Ap, and thus do not need to be updated in subsequent 
iterations. 

Finally, in order to constrain the parameters Aq, Ai, 
and (Tra, we define our likelihood based on Eqn. |69| 



ln£ = y^lnp(m^,Zi.cd|^)- 



(71) 



By maximizing this likelihood with respect to Aq and 
Ai we can use our training clusters to estimate the Wccn 
filter parameters. 

8.3.2. Calibrating the w Filters 

We now turn to calibrating the w filters. We begin by 
calibrating the foreground and satellite filters, f[g{w) and 
/sat(w). For these purposes we assume that all satellites 
follow the same spatial profile independent of brightness. 
Both fig{w) and /sat(w) can be calibrated in a Monte 
Carlo fashion. We note that foreground galaxies will 
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Figure 12. Histograms of w parameter for central (black solid), 
satellite (red long-dashed), and foreground/random (blue short- 
dashed) galaxies. It is clear that there is some power here to dif- 
ferentiate between central galaxies and satellites and foregrounds, 
but it is not perfect. However, there is some advantage in being 
able to reject a low connectivity bright galaxy as a likely interloper 
that does not fit the central galaxy model. 



uniformly sample the cluster area 7rP^(A), and thus to 
evaluate ffgiw) we draw random points uniformly within 
the disc of every training cluster, and compute the cor- 
responding f[g{w) parameters, iv[g and aw,ig- 

The satellite filter /sat {w) is computed in a similar fash- 
ion. First, for every training cluster we randomly select 
a cluster member with a probability p that is propor- 
tional to the membership probability. For this randomly 
selected member we compute w at the location of that 
satellite. After computing w for all the training clusters, 
we compute the corresponding /sat(w) parameters, uisat 
and (Tui^sat- 

We can now turn our attention to the distribution 
/ccn(w). Consider now the distribution /(tu) for all cen- 
tral galaxies. This total distribution is then 

f{w) = Pcon/cGn(u')+Psat/sat(w)-|-(l-Pccn-^'sat)/fg(w). 

The only unknowns in this equation are the mean Wcon 
and rms cr^,. con for the central filter, so we write the like- 
lihood 



h^ '^(^ccn;'-^tx;,ccn) 



ln/(w). 



(73) 



where the sum is over all training clusters. We then 
maximize t his l ikelihood to find Wcon and rms cru,,cGn- 

In Figure [12] we show the central, satellite, and fore- 
ground /(w) filters for the final training iteration. It 
is clear that there is some power here to differentiate 
between central galaxies and satellites and foregrounds, 
but it is far from perfect. In particular, satellites are only 
slightly less well connected than central galaxies. 

8.3.3. Subsequent Iterations 

As noted above, in our first iteration our catalog of 
CG galaxies is constructed using a simple centering algo- 
rithm: i.e., select the brightest high-probability member 
as the cluster center. In subsequent iterations of the clus- 
ter finder calibration we use the probabilistic centering 
algorithm described in Section [8?T] After application of 
our centering algorithm we have the important advantage 
that each cluster now comes tagged with Pcen and Psat- 
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Ther efore, we can no w repeat the cahbrations from Sec- 
tions 8.3.1 and 8.3.2| while using the correct Pcen and Psat 
for each individual galaxy. In this way we continuously 
improve our centering model with multiple iterations of 
the cluster finding algorithm. 

9. THE CLUSTER FINDER 

We have now described in detail all the ingredients that 
go into the redMaPPer cluster finder. Here, we focus on 
how these ingredients are blended within the context of 
the cluster finder to produce a catalog. In particular, we 
discuss how clusters are ultimately defined and perco- 
lated to ensure that every cluster is found once and only 
once. From a practical perspective, the cluster finding is 
broken into three stages. First, we look for overdensities 
around each individual galaxy using Zred as an estimate 
of the cluster redshift. Second, we calculate the cluster 
likelihoods for each of the galaxies that have a sufficient 
overdensity. Third, after sorting by cluster likelihood, we 
percolate through the full catalog while probabilistically 
masking out cluster members. 

9.1. First Pass 

In the first pass we wish to identify galaxies that are 
credible centers of galaxy clusters. This task involves a 
lot of data handling, and so we wish to make it as efficient 
as possible. 

We begin by taking every galaxy in the input cata- 
log with x^i^rcd) < 20 and brighter than 0.2L* at the 
red-sequence photometric redshift z^^d These are very 
generous cuts, yet they reduce the input DR8 catalog 
from 56 million galaxies to 23 million possible cluster 
centers in the redshift region 0.05 < z^cd < 0.6. Next, we 
take all galaxies within 0.5h~^ Mpc of a candidate cen- 
ter and measure A, setting the cluster redshift to Zrcd- 
Candidate centers with X/S < 3 a re re jected. The scale 
value S — 1/(1 — C) (from Section 5.1 ) is used to ensure 
that we have detected at least three red galaxies above 
the magnitude limit. This cut rejects a further ^ 60% 
of the catalog of candidate centrals. Finally, for all cen- 
ters tha t pa ss these cuts we calculate z\ as described in 
Section |7.2| to better refine the redshift of the possible 
cluster. 

9.2. Likelihood Sorting 

Given our list of possible clusters from the first pass, 
we now calculate the cluster likelihood for each of these 
clusters. The total likelihood is a combination of the A 
likelihood and the centering likelihood. To calculate the 
A likelihood, we first calculate the richness A using the 
optimized radial scale parameters with Rq ~ 1.0 Mpc 
and /3 = 0.2 as described in Section [4] The A likelihood 
is then given by 



A 



Pn 



0, 



(74) 



where A is evaluated at the cluster photometric redshift 
z\. 

Next, following Section [8] and Eqn. 66 the centering 
likelihood is given by 



In Cc 



In [0ccn (mi I ZA , A ) Gccn ( ^rcd ) fccn{w\zx,X)], ( 75 ) 



where we combine the luminosity, Zicd, and connectivity 
w of each galaxy. The total likelihood used in the ranking 
of possible cluster centers is 



InC ~ InCx + ln£c 



(76) 



Note that the amplitude of the of the A likelihood func- 
tion is typically much larger than that of the centering 
likelihood. Thus, to zeroth order, clusters are first ranked 
by A likelihood. Two candidate centrals with similar A 
likelihoods are then ranked according to the central like- 
lihood. As will be described below, we refine the choice 
of cluster center in the percolation step, so the initial 
centering likelihood is not especially influential in deter- 
mining the final cluster center. 

9.3. Percolation 

Having rank-ordered the cluster candidates according 
to likelihood, we now need to percolate the list to assign 
galaxies to clusters and ensure that no cluster is counted 
multiple times. The basic outline of the percolation pro- 
ceeds as follows. 

1. Given cluster number i in the list, recompute A and 
zx based on the percolated galaxy catalog. At the 
beginning of the percolation, the percolated galaxy 
catalog is simply the input galaxy catalog. 

2. Determine the cluster center and centering proba- 
bility via the method outlined in Section [Sj 

3. Perform a final calculation of A and z\ with respect 
to the new central galaxy. 

4. Update the percolated galaxy catalog by masking 
out galaxies based on their membership probabili- 
ties. 

5. Remove all lower-ranked possible centers that have 
a membership probability Pmcm > 0.5 of being a 
member of cluster i. Note these galaxies are still 
allowed to provide membership weight to lower- 
ranked clusters as part of the percolated galaxy 
catalog. 

6. Repeat Step 1 for the next cluster in the ranked 
list. 

9.3.1. Masking Galaxies 

Masking galaxies based on their membership probabili- 
ties is the "probabilistic percolation" step of the redMaP- 
Per algorithm. To perform this step, we keep track of the 
"total probability" that a galaxy belongs to a cluster, 
which we call ptaken- The probability pfroc = 1 - Ptakcn 
is the probability that the galaxy does not belong to any 
cluster. Initially, one has ptaken = and pfree = 1 for all 
galaxies. Upon finding a galaxy cluster, the entire galaxy 
catalog is percolated by updating the probability ptakcn 



Ptakcn,z-|-1 — J'takcn,^ 



In the case of the training runs, we take every "seed" 
at the spectroscopic redshift. 



(77) 

where Pmom is given by Eqn. [Tl 
Now, when we re-estimate the richness of cluster i + 
ilaxy 1, we must take into account the fact that some of the 
galaxies have a non-zero probability of belonging to a 
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cluster j < i + 1. We do so by modifying the richness 
calculation from Eqn. [2] via 



A 



PfrccPn 



(78) 



The first factor above is simply the probability that a 
galaxy is "free" to belong to the new cluster, and Pmcm 
is the standard membership probability from Eqn. [l] For 
instance, suppose a galaxy has a probability Pmem = 0.3 
of belonging to the first cluster in the rank-ordered list. 
In this case, the galaxy still has 70% of its probability 
to give to a cluster lower in the list. In practice, when 
quoting cluster membership probabilities Pmcm we report 
not the raw Pmcm value as given by Eqn. [l] but rather 
the product PfrooPmcm for that galaxy-cluster pair. That 
is, the reported value is the correct probability that the 
given galaxy belongs to the cluster under consideration. 
For galaxy clusters that are sparse in the sky (e.g., at 
high richness) these corrections are negligible. 

9.3.2. Extent of Clusters and Percolation Radius 

As noted above, cluster richness is measured within a 
radius Rc{X) that optimizes the signal-to-noise ratio of 
the richness measurements (R12), but is not in any way 
chosen to be related to standard definitions of the extent 
of a halo, say i?200c, the radius within which the average 
density is 200 times the critical density of the Universe at 
the redshift of the cluster. For cosmological purposes, it 
is useful to differentiate between the radius Rd^) which 
defines the richness measurement, and the percolation 
radius that is used to mask out cluster members and 
blend or deblend nearby systems. In particular, ideally 
one selects the percolation radius so that it matches as 
best as possible the percolation radii employed in the 
halo definition used to calibrate the corresponding halo 
mass function. 

In the appendix of R12 we used maxBCG clusters to 
obtain an approximate scaling of mass to A richnes^^ 
We found that the slope of the mass~A relation is consis- 
tent with 1, and that i?200c ~ 1.5i?c(A). Consequently, 
we have adopted 1.5i?c(A) as our default percolation ra- 
dius. That is, galaxies are masked out to this radius. 
We note that while galaxies outside the i?c(A) radius are 
not used in the summand in Eqn. [2j we can still esti- 
mate Pmem in cxactly the same way as we do with all 
other galaxies out to an arbitrary radius, which is how 
we implement the large percolation radius above. 

In practice, for the X/S > 20 richness threshold we 
have employed, changing the mask radius by ±50% has 
a very small impact on the resulting cluster catalog. Only 
a small number of clusters {'^ 5%) - primarily satellites of 
the richest A > 100 clusters - are affected at all by mak- 
ing this change. We expect to return to the question of 
what the optimal masking radius is in future work, par- 
ticularly within the context of cosmological constraints 
from galaxy clusters. 

10. THE REDMAPPER SDSS DR8 CLUSTER 
CATALOG 

We have run the redMaPPer cluster finding algorithm 
in the SDSS DR8 photometric catalog described in Sec.[2j 



^° As shown in Appendix IgI Acoi used in R12 is within 
of the multi-color A used in tms work. 
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Figure 13. Footprint of the redMaPPer DR8 catalog, with clus- 
ters binned into a Mangle simple pixelization scheme of depth 7. 
All clusters with A > 5 and zx S [0.1,0.3] are shown to better 
illustrate the large-scale structure in the catalog. 



The full cluster finder run contains all clusters with A > 
5S{z\) and z\ £ [0.05,0.6]. However, we have chosen to 
apply very conservative cuts to our catalog . The cuts 
we apply are: 

1. The richness is cut to A > 20S{z\). Roughly speak- 
ing, this requires that every cluster have at least 20 
galaxy counts above the flux limit of the survey or 
0.2L* at the cluster redshift, whichever is higher. 
From R12, we estimate that this results in an ef- 
fective mass cut of M200 ?J 10^^ Mq. 

2. The redshift range is cut to zx € [0.08,0.55], so as 
to minimize edge effects from the training sample. 

3. Only clusters with /,„ask < 0.2 are included (see 
Eqn 23), ensuring clusters are not overly compro- 
mised~Dy bad fields, bright stars, and survey edges. 



The resulting cluster catalog contains 25,236 systems. 
In Figure [13] we show the full footprint of the catalog. 
The color scale shows the density contrast relative to 
the mean cluster density, where red regions are denser 
than average and blue regions are less dense, as estimated 
using all A > 5, za € [0.1,0.3] clusters so as to give a 
better sense of the large scale structure in the survey. 
The image was produced by binning the catal og into a 
Mangle sirn ple pixelization scheme of depth 7 ( Swanson 
let al.l|20 08|)^ 

rPii 



In~FIgure[T4]we show the comoving density of redMaP- 
Per clusters with X/S{z) > 20 over the full redshift range 
of interest. The comoving density is roughly constant 
at z\ < 0.35 where the catalog is volume limited. At 
z\ ~ 0.35 the richness and redshift scatter are signif- 
icantly boosted by both the 4000 A break filter tran- 
sition and the magnitude limit of the survey reaching 
0.2L*. Therefore, the comoving density is boosted by 
low richness clusters scattering up into our sample. A 
full accounting for this scatter must be made in order to 
precisely calculate the redMaPPer abundance function, 
which we leave to future work. Above this redshift the 
magnitude limit starts to kick in (via the scale factor 
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Figure 14. Comoving density of redMaPPer DR8 clusters as a 
function of photometric redshift {zx) for clusters with \/S{z) > 
10,20,40. All densities have been computed by taking the sum of 
cluster p{z). The comoving density is roughly constant at zx < 
0.35, where the catalog is volume limited (denoted by the vertical 
dashed line). Above this redshift the comoving density falls off 
rapidly as the detection threshold rapidly increases. 



Figure 15. Two-dimensional histogram of A vs. zx for redMaP- 
Per clusters. The red dashed line shows the redshift-dependent 
richness cut of A > 20S{z). Although this figure generally shows a 
smooth distribution, the boost in low richness clusters at the tran- 
sition redshift of z = 0.35 apparent; this redshift is denoted by the 
vertical dashed line. 



S'(z)), and we only observe the most massi ve clusters. 
As an illustration of this effect, in Figure [15] we plot 
the richness A vs the photometric redshift z\ for the fi- 
nal redMaPPer catalog. The red dashed line shows the 
redshift-dependent richness cut A > 2QS{z). 

Finally, we show a sample redMaPPer cluster. In Fig- 
ure [Tel we show REDM J003208.2+180625.3, the richest 



redMaPPer cluster not found within the MCXC cluster 



catalog (Piffaretti et al. 2011 ), a system with A — 236±12 
at redshilt z\ = 0.396 ± U.U13. We note that this cluster 
is associ ated with a source in the ROSAT Bright Source 
Catalog ( jVoges et al. 19991. The specific data available 
for each of the clusters and members are described in Ap- 
pendix [H] and a detailed comparison of the redMaPPer 
clusters to X-ray and SZ catalogs is presented in Paper 
II. 

11. PURITY AND COMPLETENESS 

Purity and completeness can mean many different 
things depending on the context. There is a tendency 
to think of purity as the probability that a cluster in the 
catalog is a real cluster, and to think of completeness 
as the probability that a real cluster is in the catalog. 
However, it is often incorrect to think of these quantities 
as calibrating failure rates of the algorithm. Here, we 
adopt specific definitions of purity and completeness and 
discuss their implications. 

For cluster cosmology, the relevant quantity is the 
probability of detecting a halo of mass M with richness 
Aobs , which can be decomposed into a convolution of two 
components, 

P(Aobs|M) = J dAt™cP(Aobs|Atruc)P(Atruc|M). (79) 

The probability P(Atruc|-^^) is a feature of the Uni- 
verse, and must be properly marginalized over in any 
cosmological study that relies on the cluster number 
function. Constraining this probability distribution can 




Figure 16. Image of the cluster RM J003208. 2-1-180625.3, the 
richest redMaPPer cluster not found within the MCXC cluster 
catalog. This system has A = 236 it 12 at a redshift of zx = 
0.396 it 0.013, and is associated with a source in the ROSAT Bright 
Source Catalog. This particular cluster has three candidate cen- 
ters, denoted with blue boxes. 



also be s upplemented by u tilizing realistic mock cata- 
logs (e.g., [Song et al.|[2012a Wechsler et al., in prepara- 
tion) which we return to m future work. On the other 
hand, /'(Aobs|Atruo) is a feature of the cluster finding al- 
gorithm itself. This probability fully contains all of the 
information associated with measurement error in our 
catalog. In the present work, we define completeness 
and purity as specific integrals over this distribution. 

Purity and completeness, used as a simple 
parametrization of /'(Aobs|Atrue), can be estimated 
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in several ways. Perhaps the simplest consists of remov- 
ing galaxy clusters, randomizing galaxy positions, and 
then re-inserting galaxy clusters. The cluster finding 
algorithm can then be rerun, and one can determine 
which clusters are detecte d, and how many "false' 



clusters are detected 
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Kozo et al. (2U11), such an algorithm is fundamentally 



hawed because background galaxies are not uniformly 
distributed. Consequently, we take a somewhat different 
approach, as described below. 

1. Generate random points: We generate a list of ran- 
dom points uniformly sampling the input galaxy 
mask. This ensures we sample all the systemat- 
ics in the survey, as well as the effect of masked 
regions. 

2. Sample "true" cluster richness and redshift: Using 
the full cluster catalog (A > 5), we randomly sam- 
ple galaxy clusters to generate pairs of parameters 
(Atruc^truc)- This cusurcs our model cluster distri- 
bution has the same richness and redshift distribu- 
tion as the final catalog, including covariances. 

3. For each pair of sampled values (Atruoj-^truo)) 
sign them a spatial location using a random point 
from Step 1, and sample galaxies using the clus- 
ter mod el: Using the same method as in Sec- 
tion |5.2 we use Monte Carlo sampling to gener- 



ate 5000 galaxies with the model radial and lumi- 
nosity profiles. From this sample of 5000 cluster 
galaxies, we randomly sample Atruc galaxies from 
within i?c(Atruc), as well as fcAtmc galaxies from 
-Rc{Atrue) < r < i?c(2Atruc), whcre k scales with 
Atrue as appropriate for the radial profile. This en- 
sures that our fake clusters do not have artificial 
hard edges. 

4. Measure Aobs for the generated fake cluster at the 
random location, and repeat 100 times. When mea- 
suring Aobs, we mask out galaxies according to the 
bright stars and edges in the survey mask, as well 
as applying any necessary magnitude limits. We 
do not, however, make corrections for higher order 
effects such as blending of galaxies. 

In this way, we generate a map over the full sky of the 
detectability of clusters as a function of redshift and rich- 
ness, while taking into account the large-scale structure 
that is already imprinted on the galaxy catalog. This 
is a more stringent test than a random-background test, 
but still does not capture the additional effect that cor- 
related large scale structure can have on the galaxy clus- 
ters. However, we expect that these additional effects 
are subdominant: while the correlation length of galaxy 
clusters can be as large as ~ 20 Mpc, the typical length- 
scale over which projections are effective is ^ 100 Mpc 
or more, so for most of this volume I + ~ 1. Roughly 
speaking, we would expect no more than 20% corrections 
to our estimated impurity from these effects, so for exam- 
ple, if 5% of our clusters suffer from projection effects in 
this analysis, it likely that this fraction is underestimated 
by by ~ 0.2 x 0.05 = 1%. A more detailed treatment of 
projection effects will be presented in a future work. 



In Figure[T7]we illustrate how we use the above outputs 
to define purity and completeness. The figure shows the 
expectation value (Aobs) for the observed richness of a 
galaxy cluster vs. Atrue for a narrow redshift slice (0.2 < 
z < 0.22). Note that although Atrue is a fixed value, each 
cluster has a distribution of Aobsi and we have plotted 
the mean value. 

To define completeness, consider the sub-sample of 
galaxy clusters in some richness bin in Atrue, e.g., that de- 
fined by the vertical red short-dashed lines in Figure [TT} 
The bulk of this cluster sample falls within a tight locus 
around the Aobs ~ Atrue line, with some noise associated 
with background fluctuations. The mean relation can 
be measured, including its scatter, using fitting methods 
robust to outliers (we rely on median statistics) . The di- 
agonal red short-dashed lines show the ±4cr scatter, and 
points outside this region are gross outliers. We see that 
all such outliers fall above the main cloud of points: these 
are projection effects, where we placed a fake galaxy clus- 
ter atop an existing richer structure. The completeness 
c( Atrue) is defined as the fraction of the non-outlier points 
to the total number of clusters in the richness bin, i.e., 
it is the ratio of the number of clusters within the red 
dashed parallelogram (Aobs is consistent with Atrue within 
the scatter) to the number of clusters in the Atrue biu. 

Note that with this definition c(Atruc) < 1 does not 
imply that we are missing clusters. Instead, it is simply 
estimating the fraction of clusters at a given Atrue that 
suffer from severe projection effects. Similarly, for clus- 
ters with Atrue near the detection threshold, a fraction 
of these clusters with have Aobs less than the detection 
threshold. Thus, these clusters are only "missing" due 
to well understood observational scatter. 

Similarly, we can estimate purity by considering clus- 
ters in a bin in Aobs, e. g., that defined by the blue long- 
dashed lines in Figure [17] Several of the clusters in this 
bin are clear outliers compared to their corresponding 
Atrue- The fraction of such outliers in the Aobs bin is the 
impurity. However, we note one additional restriction; 
that is, we discard all outliers with Aobs > Atrue/2, de- 
noted by the grey region in the figure. We note that in 
any projection effect, Aobs = Amain + Aprojj the richness of 
the main and projected halo respectively. By definition, 
the main halo has a richness Amain > Aobs/2, and we are 
concerned with calculating the purity of main halos only. 
Thus, any fake cluster with Atruo < Aobs/2 is necessarily 
a projection on a real, significantly richer cluster in the 
catalog and should be discarded in this analysis. As with 
the completeness calculation, we emphasize that the re- 
sulting purity is the fraction of galaxy clusters as a func- 
tion of the observed richness that suffer from projection 
effects, and does not represent an absence of galaxies. 

To formalize all of the above discussion, we define the 
completeness in a bin of richness Atruo as 



J2 ^^ohs -P(Aobs|At 



completeness 



in Atruc bin 



A^(Atrue) 



(80) 



where the sum is over all clusters in a given bin of Atrue- 
We define Aq = Atrue — 4(t, with the restriction that 
Ao > 20; Al = Atrue + 4ct; = crf^t + erf; and iV(Atrue) 
is the total number of clusters in the bin. We estimate 
CTint directly from the output as the intrinsic scatter in 
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Figure 17. Expectation value of the measured richness ((Aobg)) 
vs. input richness (Atrue) for simulated clusters in the narrow red- 
shift range 0.2 < 2 < 0.22. Note that although Atrue is a fixed 
value, each cluster has a distribution of X^hs and we have plot- 
ted the mean value. To measure completeness, we consider the 
sub-sample of clusters in a richness bin in Atrue, defined by the 
vertical red short-dashed lines. While most of the cluster sam- 
ple falls within a tight locus around Xoha ~ Atrue, there are some 
cluster that fall above the 4cr contours defined by the diagonal 
red short-dashed lines. These outliers are projection effects, where 
we placed a fake cluster atop an existing richer structure, and are 
counted toward incompleteness. To measure purity, we consider 
the sub-sample of clusters in a richness bin in Aobsi defined by the 
blue long-dashed lines. The clusters that are significant outliers 
with low Atrue are impurities where the measured cluster is the 
result of a projection effect of multiple systems. The grey region 
denotes "unphysical" projections where Aobs ^ 2 Atrue, and as such 
the fake cluster with richness Atrue is the secondary rather than 
the primary halo. 



the Aobs~'^truc relation, and we have chosen to define an 
outher (incomplete) cluster as any cluster that has a mea- 
sured richness Aobs that is more than Aa discrepant from 
its true richness Atrue- 
By the same token, the purity is defined as 



rue J 



purity 



in Anbs bin 



1 A„hs bi 



■fxabs l '^•^°t>s ^'(Aobs|AtruG) 



(81) 



where now the sums are over all clusters in a given bin of 
Aobs- Note that we have the additional restruction that 
the sum is restricted to systems with Atruo > (Aobs) /2- 
We define Aq = Atrue - 4(Tint and Ai = Atrue + 4crint as 
before, and Aobs.i and Aobs, 2 are the extent of the richness 
bin in quest ion . 

In Figure [18] we show the completeness and purity as 
a function oirichness for five redshift bins for the DR8 
galaxy and cluster catalog. At low redshifts {z < 0.3) 
the completeness is essentially > 99% at A > 30, but 
falls below this threshold due to clusters randomly scat- 
tering in and out of the selection threshold. At higher 
redshift, as we encounter the magnitude limit of the DR8 
catalog our richness threshold increases and thus the rich- 
ness at which these threshold effects come into play also 
increases. 

Our purity is > 95% for all richness and redshift bins, 
with the richest systems being less pure. This can be un- 
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Figure 18. Top panel: Completeness as a function of input rich- 
ness, Atrue, in five redshift bins for the DR8 catalog. At low red- 
shift, the completeness at A < 30 falls off as measurement errors 
scatter clusters in and out of our A > 20 selection threshold. At 
higher redshifts, the selection threshold increases, as does the mea- 
surement error, leading to a broader decrease extending to higher 
richness values. Bottom panel: Purity — i.e., fraction of galaxy 
clusters not affected by projection effects — as a function of mea- 
sured richness, Aobsi in five redshift bins. 



derstood very simply: consider a chance superposition 
of two clusters of richness A leading to a single detec- 
tion of richness 2A. This factor of two shift has a much 
more dramatic impact on the overall abundance function 
at the rich end, simply because the richness function is 
steeper there, i.e., a constant projection fraction in Atruo 
translates into a projection fraction that decreases with 
Aobs- Again, all these "impurities" actually correspond 
to real clusters; it's just that the observed richness has 
been systematically overestimated. 

One curious feature of our purity is that it seems to in- 
crease with decreasing richness, and with increasing red- 
shift. This is a consequence of our definition: at lower 
richness and higher redshifts, the measurements errors 
in the richness are larger, so a cluster that is a Aa out- 
lier needs to be more and more of an extreme projection, 
which makes such 4tT outliers rarer. That is, the pu- 
rity increases not because there are fewer projections, 
but rather because the projections that do occur become 
increasingly less important relative to the observational 
errors in the richness estimates. 

12. CLUSTER A/IASKS 

One of the great advantages of using model clusters 
placed randomly on the real sky is that we can use the 
same output to map the detectability of redMaPPer clus- 
ters across the entire survey. In this way, we can directly 
construct a set of random points directly applicable to 
the cluster mask, which is not the same as the galaxy 
mask that defines the survey. An appropriate set of ran- 
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dom points is essential for cross-correlation s tudies for 
cluster cosmology (e.g., |Landy fc Szalay||1993" ). 

As an illustration of tne difference between the galaxy 
mask that defines the survey and the cluster mask that 
defines the redMaPPer catalog, we have run a dense sam- 
ple of random points in the vicinity of Arcturus using the 
methods described in Section [TT] This very bright star 
contaminates the SDSS photometry over a large area, 
and thus effectively masks out a region of the sky that 
is 0.8° in radius. To isolate the effect of the survey 
mask, all the random points have the same true rich- 
ness Atruc — 40, with a redshift distribution appropriate 
for redMaPPer. 

In Figure [19] we show the map of the detectability of 
Atruo = 40 clusters in a 4° x 4° region around Arcturus. 
Each pixel shows the frac tion of time a sample cluster 

where black is 0% and white 
< z < 0.2, up- 
40 clusters is 



80 



is detected, using Eqn. 
is 100%. In the low redshift bin (0.1 
per left panel) the detectability of Atruc 
essentially 100% outside the Arcturus mask, except for 
a few pixels around bright Tycho stars. Note, however, 
that due to our requirement that the area of a cluster 
must not be significantly masked out (/mask < 0.2), the 
edge for the detectability of a cluster at these redshifts is 
slightly farther from the center of the Arcturus than the 
edge of the galaxy mask (denoted by the red dashed line) . 
At higher redshift these edges drift closer to each other as 
the angular extent of the clusters decreases. However, in 
the highest redshift bin (0.4 < z < 0.5, lower right panel) 
the cluster is only detected ~ 60% ±30% of the time (see 
Figure 18) due to the survey depth. The detectability 



varies significantly when clusters approach the threshold 
and we see a strong dependence on the local depth and 
structure. 

13. SUMMARY 

In this paper, we have introduced redMaPPer, a red- 
sequence cluster finder that is designed to make optimal 
use of large photometric surveys. As a case study in 
the implementation of the algorithm, we have run on 
the SDSS DR8 photometric catalog. We have shown 
that redMaPPer improves significantly on previous clus- 
ter finders (see also Paper II), with many features that 
will be required to take advantage of upcoming surveys 
such as DES and LSST. In particular: 

1. redMaPPer is based on a multi-color extension of 
the optimized richness estimator A, which has been 
shown to be a good mass proxy (R12 and Paper II). 

2. redMaPPer is self-training, with a modest require- 
ment in the number of training spectra, which can 
themselves be limited to the brightest cluster galax- 
ies. This makes it particularly well-suited to high- 
redshift surveys. Furthermore, the multi-color red- 
sequence model makes optimal use of all color data 
at all redshifts, with no sharp features as the 4000 A 
break transitions between filters. 

3. redMaPPer can handle complex survey masks. 
Both mask-corrected richness values can be com- 
puted, as well as cluster- appropriate random point 
catalogs for large-scale structure studies. 




Figure 19. Map of detectability of redMaPPer clusters of Atruo = 
40 in the region of the DR8 galaxy mask in the vicinity of Arcturus 
(red dashed circle). Each panel is 4° on a side. At low redshift 
(0.1 < z < 0.2, upper left panel) a cluster will be detected es- 
sentially 100% of the time, except when it falls directly on top 
of a star (including typical Tycho stars, which show up as small 
black regions in the plot). Note that due to our requirement than 
no more than 20% of the area of the cluster is masked out that 
the effective mask from Arcturus is slightly broader than that of 
the survey mask. At higher redshift this effect is smaller because 
the clusters subtend a smaller physical region on the sky. In the 
0.4 < z < 0.5 bin (lower right panel) th e cluster is only detected 
~ 60% ± 30% of the time (see Figure [Tsl. 



4. All clusters are assigned a redshift probability dis- 
tribution P{z), which enables a more accurate re- 
construction of the redshift distribution of the clus- 
ter population relative to simple point-redshift es- 
timates. 

5. The centering of clusters is fully probabilistic. In 
this way, the uncertainty in the position of the clus- 
ter can be handled in an analagous way to the red- 
shift uncertainty provided by P{z). 

6. The algorithm is numerically efficient, and can be 
run on large surveys with modest computing power. 

Using the red-sequence model derived in the redMaP- 
Per calibration phase, we have derived two red-sequence 
based photometric redshifts. The first, Zrod, is a red- 
sequence template-based photo-z, that has been designed 
to generate a good "first-guess" estimation of the redshift 
in each cluster. We have also shown, in Appendix|D] that 
Zrcd compares very well to existing DR8 photometric red- 
shifts for this specific class of galaxies. However, Zrod has 
the advantage that it requires many fewer spectroscopic 
training galaxies. Moreover, these galaxies can be the 
brightest galaxies in the clusters, with no penalty to the 
performance of Zrcd at the faint end of the galaxy sam- 
ple. The second, z^, is a very precise photo-z derived 
from fitting all cluster members simultaneously to the 
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red sequence model. In addition, we derive a P{z) esti- 
mator for z\, which we show is superior to point-based 
photometric redshifts for the purposes of estimating the 
redshift distribution of the galaxy clusters. In DR8, this 
is especially true in the region of the filter transition at 
z - 0.35. 

As a case study in the implementation of the algo- 
rithm, we have run redMaPPer on the 10,400 deg^ BOSS 
region from the SDSS DR8 photometric catalog. Using 
red galaxy spectroscopic redshifts from 1/5 of the total 
area from z e [0.05,0.6], we are able to constrain a ro- 
bust red-sequence model that defines both the richness 
and photometric redshift estimators. The photometric 
redshifts, z\, have small bias and low scatter, ranging 
from cr^ = 0.006 at z ~ 0.1 to cr^ = 0.020 at z - 0.5, 
due to increased photometric noise near the survey limit. 
The rate of catastrophic outliers is low, with only ^ 1% 
of galaxy clusters appearing as Aa outliers. Note that be- 
cause of our high photo-z precision, a cluster at z = 0.1 
with a redshift offset as small as Az = 0.025 is con- 
sidered a catastrophic redshift failure. Furthermore, we 
show that the majority of these outliers are bad centers 
rather than bad redshifts; when the catalog is cleaned by 
demanding that central and satellite galaxies with spec- 
troscopy must all be within 1,000 km/s, the failure rate 
decreases to < 0.2%. 

After running redMaPPer on the full DR8 photomet- 
ric catalog, we apply a conservative selection cut of 
X/S{z) > 20, for a total of 25,236 clusters in the redshift 
range of z £ [0.08,0.55]. As shown in paper II, the co- 
moving density of redMaPPer clusters satisfying this cut 
it lower than that of all other SDSS photometric cluster 
catalogs. The scale factor, S{z), given by Eqn. 22 defines 
the correction factor on the richness caused by the survey 
depth. The catalog is volume-limited at z < 0.35, where 
S = 1 and the survey depth is brighter than the fiducial 
luminosity cut of 0.2L^, used by the A richness. Because 
our selection threshold corresponds to a total of 20 galaxy 
detections, as we lose galaxies at high redshift due to the 
magnitude limit of the survey, these 20 galaxies must all 
be due to bright members. Therefore, the correspond- 
ing richness threshold of 20/S'(z) is much higher. This 
increased detection threshold results in fewer galaxy clus- 
ters at high redshifts. Our adopted richness threshold of 
20 detected red sequence galaxies is chosen to provide the 
most robust cluster catalog possible, with a mass thresh- 
old of M > 10^** Mq where our catalog is volume limited 
at z < 0.35 (R12, Paper II). Although the full redMaP- 
Per catalog extends to lower richnesses, we expect expect 
performance will worsen as one moves towards lower and 
lower richness thresholds. 

Finally, we investigate the purity and completeness of 
our cluster finding algorithm, focusing on the observa- 
tionally relevant probability distribution P(Aobs|AtruG)- 
We have defined impurity and incompleteness as the frac- 
tion of clusters for which the observed richness Aobs is 
significantly different from the true richness Atruo- These 
outliers are caused by projection effects: when two halos 
are merged together, this manifests itself as incomplete- 
ness — a cluster with richness Atrue is up-scattered sig- 
nificantly, so it is "missing" from where it should have 
been — or impurity — the richness Aobs of such a clus- 
ter is significantly overestimated. We note that while 



the completeness of redMaPPer is near 100%, the purity 
is ~ 95% at the rich end, increasing at lower richness. 
This decrease simply reflects larger observational error 
(in a proportional sense) for lower richness clusters: i.e. 
"outliers" become more rare not because projection ef- 
fects are less rare, but because projection effects become 
sub-dominant to observational uncertainties. Our esti- 
mate of the incidence of projectio n effects is thus ^ 5%, 
similar to what was estimated in Rozo et al. (20111. A 



more detailed analysis of projection ettects tor redJVIaP- 
Per clusters will be presented in a future work. 

In Paper II, we present a detailed comparison of the 
redMaPPer cluster catalog to various X-ray and SZ cat- 
alogs with high quality mass proxies. In all cases, we 
show that the redMaPPer richness A is a low scatter 
mass proxy with high completeness and low impurity 
compared to these "truth" tables. We also compare the 
performance of redMaPPer to other photometric cluster 
finders that have been run on SDSS data, and show that 
redMaPPer outperforms these other algorithms in all 
metrics (e.g., photo-z performance; mass scatter; and pu- 
rity and completeness) , though some do perform equally 
well in subsets of these categories in specific redshift 
ranges. 

While this present work has focused on the application 
of redMaPPer to the SDSS DR8 catalog, we emphasize 
that this algorithm was developed specifically for upcom- 
ing large photometric surveys such as DES and LSST. In 
particular, its ability to simultaneously utilizes all avail- 
able photometric data, its smooth handling of the filter 
transition of the 4000 A break across filter passes, and its 
ability to self-calibrate using only minimal spectroscopic 
training samples of bright cluster galaxies are all specifi- 
cally designed to enable cluster finding in these new pho- 
tometric data sets. This will be especially advantageous 
at z > 0.7 in the Southern Hemisphere, where we do not 
have the advantages of more than a decade of survey data 
from the SDSS spectrograph. Thus, in short order we ex- 
pect redMaPPer will be capable of producing large, high 
quality catalogs of ~ 80,000 clusters at z < 1 with DES, 
opening a new era of high redshift cluster cosmology. 
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APPENDIX 

A. ITERATIVE EVOLUTION OF THE MODEL 

CALIBRATION 

For illustrative purposes, in this Appendix we collect 
the figures illustrating the how the red sequence model 
evolves as we iterate our calibration procedure. In Fig- 
ure [20] we show the red sequence parameters character- 
izing the mean color — i.e., c(z) and s{z) — for each of 
the firs t 3 iterations of our red sequence calibration, while 
Figure [2l] shows the corresponding diagonal elements of 
the covariance matrix characterizing the intrinsic scat- 
ter in each of the colors. As noted in Section 16.51 and 
Figure [5] three iterations are sufficient to achieve con- 
vergence in the redMaPPer catalog in terms of richness 
and redshift estimates. 

B. PHOTOMETRIC REDSHIFT CORRECTION 

PARAMETERS 

B.l. Constraining Zred Correction Parameters 

Our approach to constraining the Zred correction pa- 
rameters Cz{z) and Sz{z) is similar to that employed for 
the red sequence calibration (note the z subscript). As 
before, we have chosen to constrain these parameters at 
a given node spacing, using cubic spline interpolation be- 
tween the nodes. The node spacing we have chosen for 
DR8 is 0.05 for Cz{z) and 0.10 for Sz{z), suited to the 
characteristic variation scales. 

One significant complication that we have to deal with 
is that we have membership probabilities for all the 
galaxies. In order to properly make use of the proba- 
bilities, as in Eqn. 33] we need to know the background 
PDF. Unfortunately^ there is no first-principle way of 
calculating the z-^cd background as a function of ztruo- 
Therefore, we have chosen to assume the background is 
a Gaussian function with zero mean and finite width, and 
to marginalize over this background as a set of nuisance 
terms. As above, we assume the background width, ab{z) 
is a smoothly interpolated function with a node spacing 
of 0.10. To ensure that we are calculating the correction 
factors appropriate for red galaxies, and not blue cluster 
members and interlopers, we limit ourselves to galaxies 
with pmom > 0.7. 

Given a model correction, 

{Cz\ztYuc,mi) = Cz(ztruc) + (ztruo) ["^i " fni{z){ztruc)], 

(Bl) 

then we have a Gaussian PDF for the true galaxies, 
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The total likelihood is then 

£ = w[PmcmGi + (1 - Pmcm)G2], (B4) 

where we have made the addition of a weight function, w, 
which is a smooth function of that de- weights galaxies 
with large and are possible outliers. The weight w is 



1 



cxp[(x2-xi5)/0.2] + l' 



(B5) 



where X95 is the 95 percentile of all galaxies with 

Pmem > 0.7. 

As before, we find the Cz{z), Sz(z), and ab{z) pa- 
rameters by maximizing ^ C using the downhill-simplex 
method. 

B.2. Constraining z\ Correction Parameters 

Our approach to constraining the zx correction param- 
eters is an alogo us to that used for the Zj-cd parameters in 
Appendix |B.1[ However, our job is a little easier be- 
cause we are applying corrections such that (ztruokA) is 
unbiased rather than the converse. Therefore, the cor- 
rection term can be a function of zx. For DR8, we use 
a cubic spline interpolation with node spacing of 0.04 
for {czti^^Jzx). In addition, we allow an additional vari- 
ance term as we find that our raw zx errors are under- 
estimated. For tTj,^ we use a smooth function with a 
node spacing of 0.10. To ensure that we are using well- 
measured clusters, we limit ourselves to calibration clus- 
ters that have X/S{z) > 10, where S is the scale factor 
defined in Eqn. |22[ Essentially, this limits us to clus- 
ters with at least 10 red galaxies above the luminosity 
threshold or magnitude limit. 

Given a model correction {cz-i^^Jzx) and intrinsic scat- 
ter correction CTz;i,int, we have a Gaussian PDF for the 
clusters, 
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-[{zx - zcg) - {Cz,,^Jzx} 

(B6) 

where trtot = + cr^^ j^^^ . The total likelihood is then 

given by In £ = ^ In G. As before, we find (cztr^Jzx) and 
'^zx int maximizing this likelihood using the downhill- 
simplex method. 

With this parametrization in hand, we can calculate 
the corrected zx and error as zx = zx^ra-w + (cztr^Jzx), 
and — cr^^^raw + '^z int- However, we find that after 
applying these corrections there may still be small resid- 
uals in the training sample. Therefore, we iterate on this 
solution two further times to obtain a final corrected red- 
shift Zx- 

After the calibration is complete, we must also ap- 
ply these corrections to the P{zt,-uc\zx) estimation for 
each cluster. To replicate the zx offset represented by 
(cztiuckA), we first offset the central value of the P{z) 
distribution. Next, to replicate the increased scatter we 
"expand the space" between the P{z) bins, so that a 
Gaussian fit to P{z) will measure the same width as the 
corrected (Tz^ value. We find that this does an adequate 
job of maintaining asymmetries in the P{z) distribution 
which show up near the filter transitions. 
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Figure 20. Top 4 panels: Average color, c(z), at the pivot magnitude fni{z), for the first (red dotted line), second (blue dashed line), and 
third (magenta solid line) iterations of the calibration as a function of redshift. Bottom 4 panels: As top panels, but for the red sequence 
slope 5(2:). 
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Figure 21. As Figure 20 but for the intrinsic scatter 



as a function of rodsiiift. 



C. HOW MANY TRAINING CLUSTERS? 

When calibrating the red sequence in Section|6[on DR8 
data, we make use of all the spectroscopy available in our 
2000 deg training region. However, much of this is su- 
perfluous. First, most of the spectroscopic galaxies — 
even the LRG samples — are not in massive clusters. 
Second, our strategy of leveraging central galaxy spec- 
troscopy to all the galaxies in a cluster means that we do 
not require thousands of clusters to perform the calibra- 
tion. In this section, we investigate how many training 
clusters — each represented by a single spectroscopic red- 
shift for the central galaxy — are required to create an 
accurate and unbiased richness and redshift estimate. 

To test the number of required training clusters we 
follow the method of Section 16.51 to measure the bias in 
the recovered richness and photometric redshift values on 
a predetermined set of test clusters. For our test suite, we 
select the {5,10,20,40,80} richest clusters per redshift bin 
of ±0.025 in the training region. The redshift binning is 
used to ensure we have a relatively uniform coverage over 
the redshift range of interest. In practice, of course, the 
training clusters need not be so uniformly sampled. For 
each of these spectroscopic training sets we recalibrate 
the red sequence and measure the richness A and redshift 
z\ for each of the test clusters from Section |6.5[ 

In Figure [22] we show the results of these test runs. 



The left panel shows the richness bias and significance of 
the bias as a function of redshift for the various training 
samples. Although we can get a reasonable calibration of 
the red sequence with as few as five spectra per ±0.025 
redshift bin, the resulting richnesses are significantly bi- 
ased (~ la) at the transition redshift z ^ 0.35. In order 
to achieved unbiased richness estimates (< 0.3ct) then we 
require ~ 40 clusters per redshift bin. We assume any 
residual biases are due to the noise in estimating the off- 
diagonal elements of the covariance matrix. This results 
in a total of ^ 400 spectra to achieve essentially the same 
fidelity of calibration as we can achieve with millions of 
SDSS spectra. The right panel shows the photometric 
redshift bias and significance, similar to the left panel. 
For accurate photo-z estimation, we require even fewer 
training spectra: ~ 20 per redshift bin, or a total of 200. 

For upcoming photometric surveys such as DES, we 
can obtain these spectra by first running a crude run 
with an approximate red sequence model. After selecting 
bright central galaxies, these can easily be followed up 
spectroscopically, as they are the most luminous galaxies 
at any redshift. For example, over 85% of the training 
spectra required for DR8 training are brighter than < 
18.5. Thus, our method allows for an incredibly efficient 
use of limited spectroscopic resources to enable science 
in large photometric surveys. 
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Figure 22. Le/t panel, top: Average richness bias as a function of redshift for {5,10,20,40,80} training clusters per redshift bin of width 
±0.025, compared to the richness using the full DR8 training sample. All curves use the same set of 4300 test clusters. Left panel, bottom: 
Error normalized average deviation relative to the baseline. With at least 40 (10) clusters per redshift bin of ±0.025, biases are always 
< 0.3(T (< 0.5cr). Thus, with only 400 well-chosen spectra of the brightest galaxies, we can achieve nearly the same precision as is possible 
with all the SDSS spectra. Right panel, top: Average uncorrected photometric redshift {z\) bias as a function of redshift for {5,10,20,40,80} 
training clusters per redshift bin of width ±0.025, compared to the spectroscopic redshift of the central galaxy, zcG- Right panel, bottom: 
Error normalized average deviation relative to the baseline. With at least 20 clusters per redshift bin of ±0.025 we achieve the same redshift 
performance as is possible with all SDSS spectra. 



D. COMPARISON OF z^cd TO SDSS DR8 
PHOTO-ZS 

We consider two sets of photometric redshift estimates 
available for all of DR8. The first, "z^hoto" , uses an up- 

and the second, 
In this 



( 2007|F 



dated method of Csabai et al. 

uses the method otisheidon et al 
ligh 



( |2012[ ) 

probability cluster member 



section we make use of higr 
galaxies to compare these photometric redshifts to Zred 
at both bright magnitudes (where training galaxies are 
plentiful) and at fainter magnitudes. 

For our "pseudo-spectroscopic" test sample, we start 
with all clusters with A > 5 and a central galaxy with 
spectroscopic redshift zqq. We then select all members 
with Pmom > 0.9. We thus expect a contamination rate 
of up to 10%, although the real rate should be smaller 
than this. By assigning each high probability member to 
the spectroscopic redshift of the central galaxy we can 
leverage the red sequence to obtain spectroscopic quality 
redshifts to much fainter magnitudes than available in 
the SDSS main or LRG spectroscopic samples. 

Figure [23] shows the density map of the photometric 
redshift biases as a function of magnitude for Zrcd, -Zphoto, 
and p{z) for a narrow redshift slice of 0.195 < zqq < 
0.205. For Zj-cd and Zphoto we have assumed a probability 
distribution function (PDF) that is Gaussian with mean 
Zred (zphoto) and width a^,^^^ (o-Zphoto)- For the p{z) val- 
ues we use a spline interpolation to smooth the PDF and 
normalize the area to unity. On the right-hand side are 
projected histograms from the density field. The dot- 
ted red lines show the z^^d distribution for comparison. 
Note that the density plot clearly shows the separation in 
magnitude between central and satellite galaxies. Both 
Zred and Zphoto perform well down to the 0.2L, limit of 
the redMaPPer richness estimation, while the p{z) val- 
ues have a broader distribution at the faint end. These is 
also obvious structure in the photo-z bias as a function 
of magnitude. 



Figure [24] shows the same map for a narrow redshift 
shce of 0.395 < zqg < 0.405. While all the photometric 
redshifts handle the luminous galaxies very well, the ap- 
pear to be slight biases at the faint end in the case of the 
DR8 Zphoto, and a bifurcation of the distribution for the 
p{z) redshifts. The evolution of the bias in the p{z) esti- 
mates is due to a combination of effects. First, the r-band 
magnitude was used as an input to the photo-z estimator. 
For a field galaxy, a fainter magnitude correlates with a 
higher redshift. For cluster galaxies, however, galaxies of 
a wide range of luminosity occupy the same cluster. As a 
result, when using magnitude-based photo-z estimators 
on galaxies in clusters, one should expect an increasing 
bias with magnitude, which is simply a manifestation of 
the intra-cluster luminosity function. The large width of 
the error distributions relative to the other estimators are 
due primarily to the lac k of training set galax ies in that 



( |2012[ ), the main 

< 21.8 galaxy sam- 



^"^ See http://www.sdss3.org/dr8/algorithms/photo-z.php 



range. As discussed in Sheldon et al 
focus was on recovering the full r 
pie. To avoid biases induced by training set selection, 
the authors did not include the most recent BOSS LRG 
samples in that work and deferred LRG-optimizcd p(z) 
estimates to a future paper. It is also worth pointing out 
that, despite the extra width of the error distributions 
obtained when using p{z), the recovered redshift distri- 
butions obtained by summing the p{z) of [Sheldon et al" 
(2012| are still superior to the distributions estimated 
using the DR8 Zphoto or single-point z^ed estimates. 

There are two important take-home messages from this 
comparison. First, the performance of state-of-the-art 
photo-z estimators appears to be sufficiently accurate 
for bright galaxies that we would likely be able to use 
these in the initialization phase of redMaPPer without 
any loss. Second, Zred appears to be at least as good — 
if not better — than what is currently achieved, with 
much smaller spectroscopic training samples. As shown 
in Appendix [C] we can achieve this redshift performance 
with only w 400 of the brightest CG spectra. With the 
technique of assigning the spectroscopic redshift of the 
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Figure 23. Top: Density of total p(%od ~ ^Cg) a function of 
i-band magnitude rrii for all cluster members with Pmcm > 0.9 and 
0.195 < zqq < 0.205 in clusters with A > 5. The right panel shows 
the total PDF at all magnitudes. There is a small bias in ^edi 
though it is constant with magnitude. Middl e: Same as top panel , 
with ^photo calculated with the algorithm of |Csabai et al. (20071. 
The performance is good down to 0.2L*. The rignt panel compares 
the distribution for Zphoto (black line) to ^j-ed (red dotted line). 
Bottom : Same as top panel, with p{z) values from [Sheldon et al.| 
| |2012[ l. While the bright galaxy performance is good, there are 
biases at the faint end and the distribution is significantly wider. 



central galaxies to the members, we effectively increase 
the faint end of our training sample. This is very useful 
for future surveys because of the high cost of obtaining 
spectroscopic redshifts of faint galaxies. 

E. COMPUTING PERFORMANCE 
BENCHMARKS 

The redMaPPer algorithm has been designed to be 
fast, efficient, flexible, and trivially parallelized. As there 
are two parts to running redMaPPer, the calibration and 
cluster-finding stages, we split the performance bench- 
marks into two parts. 

For the calibration phase, the runtime depends on the 
number of training spectra and clusters. For the DR8 
training sample on 2000 deg^, the full calibration takes 
- SOCPUhr on a 3-year old 2.8 GHz AMD Opteron 
8389. Current Intel processors can run the calibration 
roughly twice as fast. For the minimal training sample of 
40 clusters per redshift bin (see Appendix [C]) calibration 
takes - ISCPUhr. 

The cluster-finding stage is designed to be split into 
chunks of arbitrary size on the sky. For these purposes 
we use the Mangle simple pixelization scheme ( Swanson 
et al.|[2008 ), although any pixelization scheme will work. 
As long as the overlap region between pixels is wider than 
twice the largest size of any cluster in the catalog, then 
the percolation of clusters within each cell is guaranteed 
to be unique. For the DR8 catalog, this corresponds to a 
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Figure 24. Same as Figure [231 with members selected from clus- 
ters with 0.395 < ZCG < 0.40S^^ 



border region of 1.5°, corresponding to twice the size of 
a cluster of richness ~ 300 at z = 0.05, given the mask 
radius parameters. In total, running the full DR8 cluster 
finder requires ~ 500 CPU hr including all galaxy mask 
corrections. On a modestly sized compute cluster this 
can be run in much less than one day. 

F. VALIDATING THE CORRECTION C 

In Section [5] we laid out our methodology for correcting 
the richness for survey holes and a magnitude limit that is 
brighter than 0.2L^,. In order to validate the calculation 
of the correction term C described in that section, we 
have taken a subsample of clusters with 0.15 < z < 0.3 
and simulated a more restrictive magnitude limit. We 
have chosen a magnitude limit of rrii < 19.6, which is 
0.2L* at z = 0.2, so that all clusters at higher redshift 
will have their richness corrected according to our for- 
malism. The average correction for the z = 0.3 clusters 
is similar to that for the highest redshift clusters in our 
catalog, so this test will sample the full range of correc- 
tions emplo yed. 

In Figure [25] we show the results of our test. In the 
top panel we show Aig.e vs. A0.2 for all clusters with 
0.2 < z < 0.3, where Aig.g is the richness calculated with 
a magnitude limit of < 19.6 and A0.2 is the standard 
A with a 0.2L* cut. When calculating Aig.g we have re-fit 
the photometric redshift zx to ensure that our compari- 
son is as fair as possible. It is clear in the top panel that 
the correction richness scales with uncorrected richness, 
with some scatter as expected. In the bottom panel we 
show the richness scale value {S — ~ '^scaled /Araw) 
as a function of redshift. The black squares show the me- 
dian estimated value of S derived from Eqn. |15[ while the 
black error bars represent the median error in S as de- 
rived from Eqn. 21 The red diamonds show the median 
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Figure 25. Top: Richness calculated with a < 19.6 cut vs. full 
A richness, for clusters with 0.15 < z < 0.3. The magnitude cut of 
mi < 19.6 is equivalent to 0.2L* at 2 = 0.2, so all cluster a.t z > 0.2 
in this test have S{z) > 1. The corrected richness is consistent 
with the full richness. Bottom: Scale factor 3-3^ vs. photometric 
redshift. Black squares show the scale factor and uncertainty in 
the scale factor estimated in the mi < 19.6 run (shifted slightly for 
clarity). Red diamonds show the measured shift and width. Our 
measured values agree with our model. However at the largest 
corrections we are slightly overestimating the correction term as 
well as the uncertainty in the correction term. 



measured value of 5*, and the red error bars represent the 
observed width in the distribution of S. Our predicted 
correction factor does scale with redshift as expected. 
However, our errors are slightly overestimated for the 
largest corrections. 



G. COMPARING A TO Acoi 

We now explore how the richness estimate used in this 
work, A, compares to the single-color richness Acoi used 
in R12. As detailed in Section |4] the primary difference 
in richness estimators is the replacement of the Gaus- 
sian color filter with a multi-color filter. However, 
we emphasize that there is also a subtle dif feren ce in the 
background model, as described in Section 4.1 That is, 
the filter does not distinguish between galaxies that 
are too red or too blue relative to the model, and while 
the red sequence model is symmetric, the background 
model is not. 

To make our comparisons, we have started with all 
redMaPPer clusters with A > 20. We then calculate \g-r 
and Xr~i using the appropriate color model from the red 
sequence parametrization. Our expectation is that \g~r 
should trace A at low redshift where the dominant signal 
is from the g — r color, and A^-i should trace A at high 
redshift. 



In Figure 26 we show the statistics from comparing 
Xg-r and \r-i to A. In the top panel we show the me- 
dian ratio as a function of redshift. At all redshifts the 
bias between the appropriate Acoi and A is < 10%. In 
the bottom panel we show the median normalized devi- 
ation, which is ~ Icr at low redshift and less so at high 
redshift where the richness errors are much larger due 
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Figure 26. (a) Average ratio between the single-color A^oi vs. 
redshift for both \g-r (blue dotted line) and \r~i (red dashed 
line). As discussed in the text, this offset is likely due to different 
background models. In all cases the difference between the full 
multicolor A and the color appropriate for the redshift range {g — r 
for z < 0.35 and r - i for 2 > 0.35) are less than 10%. (b) Width 
of the Acoi /A distribution as a function of redshift. The scatter is 
< 15% for the appropriate color except for the transition redshift 
of 2 ~ 0.35. (c) Average offset normalized by the richness error. 
Thus, using the single color Xg—r is systematically biased high by 
~ 1(T at low redshift, and A^-i is systematically biased low by 0.2cr 
at high redshift. 



to the magnitude limit. We attribute this bias at low 
redshift to the different background model employed, as 
galaxies that are redder than the red sequence are down- 
weighted in the A model compared to the Acoi model. 
These biases are not large, but they are significant and 
thus show the importance of using the same color model 
and consistent survey data to achieve the best richness 

estimation. 

The middle panel of Figure [26] shows the width of the 
Acoi /A distribution as a function of redshift. The scatter 
is < 15% for the appropriate color except at 2 ~ 0.35, 
where the 4000A break is transitioning from g to r. It 
is in this transition region that a single color richness 
estimator does especially poorly and we have the biggest 
advantage of using a multi-color estimator. 

H. DESCRIPTION OF COLUMNS IN THE DR8 
CLUSTER CATALOG 

The cluster catalog will be available for download when 
the paper is accepted. In this section we describe the tags 
that will be released with the catalog 
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Table 1 

Tags for redMaPPer DR8 Cluster Catalog 



Tag 



Description 



ID 

NAME 

RA 

DEC 

ZLAMBDA 

ZLAMBDA_ERR 

LAMBDA 

LAMBDA_ERR 

Z_SPEC 

OBJID 

IMAG 

IMAG_ERR 

MODEL_MAG_[UGRIZ] 

MODEL_MAGERR_[UGRIZ] 

ILUM 

PZBINS[21] 

PZ[21] 

P_CEN[5] 

RA_CEN[5] 

DEC_CEN[5] 



redMaPPer Cluster Identification Number 

redMaPPer Name 

Right Ascension (J2000 degrees) 

Declination (J2000 degrees) 

Photometric rcdshift z\ 

Gaussian error for z\ 

Richness A 

Error on A 

DR9 Spectroscopic redshift if available (—1 otherwise) 
SDSS DR8 object id 

j-band cmodel magnitude for central galaxy (dereddened) 
error on i-band cmodel magnitude 
model magnitude for central galaxy (dereddened) 
error on model magnitude 

Total membership- weighted i-band luminosity (units of L*) 

Rcdshift points at which p{z) is evaluated 

p{z) values at redshift points 

Centering probability -Pcen for up to 5 centrals 

Right Ascension for up to 5 centrals (—100 if Pcen < 0.01) 

Declination for up to 5 centrals (—100 if Pcen < 0.01) 



Table 2 

Tags for redMaPPer DR8 Member Catalog 



Tag 



Description 



ID 
RA 
DEC 
R 

PMEM 

IMAG 

IMAG_ERR 

MODEL_MAG_[UGRIZ] 

MODEL_MAGERR_[UGRIZ] 

Z_SPEC 

OBJID 



redMaPPer Cluster Identification Number 
Right Ascension (J2000 degrees) 
Declination (J2000 degrees) 
Distance from cluster center {h^^ Mpc) 
Membership probability pmem 

i-band cmodel magnitude for central galaxy (dereddened) 
error on i-band cmodel magnitude 
model magnitude for central galaxy (dereddened) 
error on model magnitude 

DR9 Spectroscopic redshift if available (—1 otherwise) 
SDSS DR8 object id 
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